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NEW NUCLEOTIDE SEQUENCES 



PCT/SE98/01947 



TECHNICAL FIELD 

5 The present invention relates in particular to nucleic acid molecules encoding GABA B 
receptors, and to methods for screening for compounds which are inhibitors of transient 
lower esophageal sphincter relaxations (TLESR), said methods comprising the use of a 
nucleic acid molecule encoding a GABA B receptor. 

10 

BACKGROUND ART 
GABAg receptors 

15 GABA (4-aminobutanoic acid) is an endogenous neurotransmitter in the central and 
peripheral nervous systems. Receptors for GABA have traditionally been divided into 
GABA A and GABA B receptor subtypes. GABAg receptors (for a review see Kerr, D.LB. 
and Ong, J. (1995) Pharmac. Ther. vol. 67, pp. 187-246) belong to the superfamily of G- 
protein coupled receptors. GABAg receptor agonists are described as being of use in the 

20 treatment of CNS disorders, such as muscle relaxation in spinal spasticity, cardiovascular 
disorders, asthma, gut motility disorders such as irritable bowel syndrome and as prokinetic 
and anti-tussive agents. GABAg receptor agonists have also been disclosed as useful in the 
treatment of emesis (WO 96/1 1680). 

25 The cloning of the rat GABAg receptors GAB A B R1 a (SEQ ID NOS: 44 and 45) and 
GABA B Rlb (SEQ ID NOS: 46 and 47) is disclosed by Kaupmann et al. (1997) Nature, 
vol. 386, 239-246. The mature rat GABAgRlb differed from GABA B Rla in that the N- 
terminal 147 residues were replaced by 18 different residues. It was presumed that the rat 
GAB A B R1 a and -b receptor variants are derived from the same gene by alternative 

30 splicing. 
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The cloning of the human GABA B receptor GABA B Rlb is disclosed in WO 97/46675. 

Reflux 

5 In some humans, the lower esophageal sphincter (LES) is prone to relaxing more 

frequently than in other humans. As a consequence, fluid from the stomach can pass into 
the esophagus since the mechanical barrier is temporarily lost at such times, an event" 
hereinafter referred to as "reflux". 

10 Gastroesophageal reflux disease (GERD) is the most prevalent upper gastrointestinal tract 
disease. Current therapy has aimed at reducing gastric acid secretion, or by reducing 
esophageal acid exposure by enhancing esophageal clearance, lower esophageal sphincter 
tone and gastric emptying. The major mechanism behind reflux has been considered to 
depend on a hypotonic lower esophageal sphincter. However, recent research (e.g. 

15 Holloway & Dent (1990) Gastroenterol. Clin. N. Amer. 19, 517-535) has shown that most 
reflux episodes occur during transient lower esophageal sphincter relaxations (TLESR), i.e. 
relaxations not triggered by swallows. It has also been shown that gastric acid secretion 
usually is normal in patients with GERD. Consequently, there is a need for compounds 
which reduce the incidence of TLESR and thereby prevent reflux. 

20 

DISCLOSURE OF THE INVENTION 

In the applicants' earlier patent application WO 98/1 1885, filed on 15 September 1997, it is 
25 disclosed that GABA B receptor agonists can be used to reduce the incidence of transient 
lower esophageal sphincter relaxations (TLESR). 

The present invention provides nucleic acid molecules encoding human and canine 
GAB A B receptors. These nucleic acid molecules will make possible the screening for 
30 compounds which are agonists or antagonists of GABAg receptors, e.g. compounds which 
are inhibitors of transient lower esophageal sphincter relaxations (TLESR). 
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Consequently, in a first aspect, the present invention provides a nucleic acid molecule 
encoding a GABA B receptor, or a functionally equivalent modified form thereof, said 
receptor being selected from the group consisting of human and canine GAB A B receptors. 

In preferred forms of the invention, the said nucleic acid molecule encodes the human 
GABA B receptor la (SEQ ED NOS: 48 and 49), lb (SEQ ID NOS: 50 and 5 1), lc (SEQ ID 
NOS: 54 and 55) or Id (SEQ ID NOS: 56 and 57); or the canine GABA B receptor la (SEQ 
ID NOS: 52 and 53) or lc (SEQ ID NOS: 58 and 59). Accordingly, the invention 
furthermore provides a nucleic acid molecule selected from: 

(a) nucleic acid molecules comprising a nucleotide sequence set forth as SEQ ID NO: 48, 
50, 52, 54, 56 or 58; 

(b) nucleic acid molecules comprising a nucleotide sequence capable of hybridizing, under 
stringent hybridization conditions, to a nucleotide sequence complementary to the 
polypeptide coding region of a DNA molecule as defined in (a); and 

(c) nucleic acid molecules comprising a nucleotide sequence which is degenerate as a result 
of the genetic code to a nucleotide sequence as defined in (a) or (b). 

Furthermore, the invention provides nucleic acid molecules of genomic origin encoding 
human GABA B receptors (SEQ ID NOS: 60 and 61) as well as nucleic acid molecules (set 
forth as SEQ ED NO: 70, 72, 74, 76, 78, 80, 82, 84) encoding additional isoforms of the 
human GABA B receptor which isoforms can be shown to be generated by alternative 
splicing. 

It should thus be understood that the nucleic acid molecule according to the invention is 
not to be limited strictly to molecules comprising the sequences set forth as SEQ ID : 48, 
50, 52, 54, 56 or 58. Rather the invention encompasses nucleic acid molecules carrying 
modifications like substitutions, small deletions, insertions or inversions, which 
nevertheless encode proteins having substantially the biochemical activity of the GABA B 
receptors according to the invention. Included in the invention are consequently nucleic 
acid molecules, the nucleotide sequence of which is at least 95% homologous, preferably at 
least 96%, 97%, 98% or 99% homologous, with the nucleotide sequence shown as SEQ ID 
NO: 48, 50, 52, 54, 56 or 58 in the Sequence Listing. 

SUBSTITUTE SHEET (RULE 26) 
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The term "stringent hybridization conditions" is known in the art from standard protocols 
(e.g. Current Protocols in Molecular Biology, editors R Ausubel et al., John Wiley and 
Sons, Inc. 1994) and could be understood as as stringent or more stringent than those 
5 defined by e.g. hybridization to filter-bound DNA in 0.5 M NaHP0 4 , 7% sodium dodecyl 

sulfate (SDS), 1 mM EDTA at +65°C, and washing in 0. lxSSC / 0. 1 % SDS at +68°C. 

Included in the invention is also a nucleic acid molecule which nucleotide sequence is 
degenerate, because of the genetic code, to a nucleic acid of the present invention and more 
particularly to one of the nucleotide sequences set forth as SEQ ID NOs: 48, 50, 52, 54, 56 
and 58. A sequential grouping of three nucleotides, a "codon", codes for one amino acid. 
Since there are 64 possible codons, but only 20 natural amino acids, most amino acids are 
coded for by more than one codon. This natural "degeneracy", or "redundancy", of the 
genetic code is well known in the art. It will thus be appreciated that the nucleic acid 
sequences shown in the Sequence Listing is only an example within a large but definite 
group of nucleic acid sequences which will encode the polypeptide as described above. 

In a further aspect, the invention provides a recombinant polypeptide encoded by a 
nucleotide sequence of the present invention, encoding a GABA B receptor. In preferred 
20 forms of the inventions, the said polypeptide comprises an amino acid sequence set forth as 
SEQ ID NO: 49, 51, 53, 55, 57, 59, 71, 73, 75, 77, 79, 81, 83 or 85 in the Sequence Listing. 
However, polypeptides of the present invention are not to limited to those having an amino 
acid sequence identical with one of SEQ ID NOs: 49, 51, 53, 55, 59, 71, 73, 75, 77, 79, 81, 
83 or 85 in the Sequence Listing. Rather the invention encompasses polypeptides carrying 
25 modifications such as substitutions, small deletions, insertions or inversions, which 
polypeptides nevertheless have substantially the biological activities of the GABAg 
receptor. Included in the invention are consequently polypeptides, the amino acid sequence 
of which is at least 95% homologous, preferably at least 96%, 97%, 98% or 99% 
homologous, with one of the amino acid sequences described by SEQ ID NOs: 49, 5 1 , 53, 
30 55, 57 59, 71, 73, 75, 77, 79, 81, 83 and 85 in the Sequence Listing. 
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Included in the invention are polypeptides of the present invention which have been post- 
translationally modified, e.g. by cleavage of an N-terminal signal sequence which can be 
e.g. 1 to 25 amino acids long. 

In yet another aspect, the invention provides a vector transformed with a nucleic acid 
molecule of the present invention. The said vector can e.g. be a replicable expression 
vector which carries and is capable of mediating the expression of a nucleic acid molecule 
according to the invention. In the present context the term "replicable" means that the 
vector is able to replicate in a given type of host cell into which is has been introduced. 
3 Examples of vectors are viruses such as bacteriophages, cosmids, plasmids and other 
recombination vectors. Nucleic acid molecules are inserted into vector genomes by 
methods well known in the art. 

Included in the invention is also a cultured host cell harbouring a vector according to the 
5 invention. Such a host cell can be a prokaryotic cell, a unicellular eukaryotic cell or a cell 
derived from a multicellular organism. The host cell can thus e.g. be a bacterial cell such as 
an E. coli cell; a cell from a yeast such as Saccharomyces cervisiae or Pichia pastoris, or a 
mammalian cell. The methods employed to effect introduction of the vector into the host 
cell are standard methods well-known to a person familiar with recombinant DNA 
20 methods. 

A further aspect of the invention is a process for production of a GABA B receptor 
polypeptide according to the invention, said process comprising culturing a host cell as 
defined above under conditions whereby the said polypeptide is produced, and recovering 
25 the said polypeptide. 

A further important aspect of the invention is a method for the screening of compounds 
which are inhibitors of transient lower esophageal sphincter relaxations (TLESR), said 
method comprising the use of a nucleic acid molecule encoding a GAB A B receptor. The 
said nucleic acid molecule encoding a GABA B receptor can e.g. be one of the nucleic acid 
molecules according to the invention encoding human or canine GAB A B receptors. 
However, it should be understood that this aspect of the invention is not limited to the use 
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of the said human and canine GAB A B receptors, but rather encompasses the use of any 
GAB A B receptor for screening for compounds which are inhibitors of transient lower 
esophageal sphincter relaxations. 

5 In yet another important aspect, the invention provides a method for the screening of 
compounds which are agonists or antagonists to a GABAg receptor, said method 
comprising the use of a nucleic acid molecule, according to the invention, encoding human 
or canine GAB Aq receptors. 

10 Brief Description of the Drawings 

Figure 1 : Map of the human GAB A B receptor gene. 

The exon/intron organisation is shown. Exons are indicated as solid boxes numbered 1-23. 
15 The part of intron 5 that is retained together with exon 6 giving rise to GABAb receptor lb 
is indicated as an open box. 



Figure 2: Expression of human GABAb receptor lb isoform in transfected C127 cells. 

20 

Western blot analysis of transfected CI 27 cells using a polyclonal anti-human GABAb 
receptor antibody. Lane 1: Untransfected C127 whole cell lysate. Lanes 2-7: Whole cell 
lysates of six independent clones transfected with human GABAb receptor lb isoform 
encoding cDNA. The clones analysed in lanes 4 to 7 express a GABAb receptor of 
25 expected molecular weight (arrow). 



Figure 3: Expression of human GABAb receptor Id isoform in transfected C127 cells. 

30 Western blot analysis of transfected CI 27 cells using a polyclonal an ti -human GABAb 
receptor antibody. Lanes 1-3: Concentrated culture media from three independent C127 
clones transfected with a cDNA expression construct encoding the human GABAb receptor 
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Id isoform. Lanes 4-6: Whole cell lysates corresponding to the clones analysed in lanes 1- 
3. the experiment revealed that the human GABA B receptor Id cDNA encodes a secreted 
isoform. The arrow indicates the bands corresponding to the Id isoform. 



Figure 4: Expression of human GABA B receptor Id isoform in E. coli. 



Western blot analysis of transformed E. coli cells using a polyclonal anti-human GAB At, 
receptor antibody. Lane 1: Lysate from an uninduced£. coli culture transformed with an 

10 pET-based expression construct encoding the human GABA B receptor Id cDNA. Lane 2: 
Lysate from an IPTG-inducedE. coli culture transformed with an expression construct 
encoding the human GABA B receptor Id cDNA. Lane 3: Lysate from an IPTG-induced 
E. coli culture transformed with an expression construct encoding an unrelated protein. 
Lane 4: The BSA-conjugated peptide previously used for immunization was loaded on the 

15 gel as a positive antibody control. 



The screening methods according to the invention can e.g. comprise the steps (a) 
transforming a cultured cell with a nucleic acid molecule encoding a GABA B receptor, so 
20 that a GAB A B receptor is expressed on the surface of the cell; (b) contacting a test 

compound with the said cell; and (c) determining whether the test compound binds to, 
and/or activates, the GABAb receptor. 

In particular GABAb receptor expressing cells, transgenic animals or cells and tissues 
25 derived thereof, nay be used to screen substance libraries for antagonist or agonist 
activities. For this purpose, GABAb receptor expression may be directed to cells'and 
tissues containing, either naturally or artificially, the necessary components allowing 
correct receptor transport and processing as well as coupling to second messenger 
pathways. Screening may be performed as ligand binding assays or functional assays. For 
30 screening, cells and tissues my be prepared in various ways, each uniquely suited to its 
purpose. Ligand binding assays are performed in vivo or in vitro using e.g. radiolabelled 
GAB A. Functional assays examplified by, but not limited to, Ca^-responses, c AMP- 
SUBSTITUTE SHEET (RULE 26) 
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responses and effects on K* channels, may be performed in living cells, broken cells, 
isolated cell membranes, as well as in tissues and in living animals. To facilitate 
measurement of physiological GABAb receptor mediated responses, GAB Ab receptors 
may be co-expressed with promiscous G-proteins like e.g. Gal 6 or Gqi5 increasing G- 
5 protein coupling. Another possible way to increase G-protein coupling is to fuse the 

GABAb receptor with appropriate G-proteins using standard molecular techniques. In order 
to further improve readouts in Ca^-response assays, GABAb receptors may be co- 
expressed with aequorin, a photoprotein cloned from the luminescent jellyfish Aequorea 
victoria. 

10 

In yet another aspect, the invention provides a pharmaceutical compositions comprising a 
soluble GABAb receptor further comprising one or more pharmaceutical acceptable 
carriers and/or diluents. 

1 5 In yet another aspect, the invention provides a method for the diagnosis or treatment of 
conditions related to GABA-dysfunction, e.g. epilepsy, psychiatric disorders such as 
depression and anxiety, cognitive dysfunction, gastroesophageal reflux disease, emesis, 
irritable bowel syndrome, dyspepsia, spasticity, arthritis, allergies, autoimmune diseases, 
neoplastic diseases, pain, and infectious diseases, comprising the use or administration of 

20 soluble forms of the GABA B receptor, such as the human GABAb receptor 1c or Id or 
functionally equivalent modified forms thereof; preferably being a polypeptidfe of the 
present invention.. 

The soluble forms of the GABAb receptor can e.g. be produced by culturing a host cell 
25 harbouring a vector comprising a nucleic acid encoding the soluble GABAb receptor under 
conditions where whereby the said polypeptide is produced, the polypeptide recovered and 
administered to a patient in need thereof. 

30 Furthermore, the level of soluble GABA B receptors in certain body fluids, e.g. serum and 
CSF, can be measured and used in diagnostics related to conditions with altered levels of 
soluble GABAb receptors in said body fluids. 
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Throughout this description the terms "standard protocols" and "standard procedures", 
when used in the context of molecular cloning techniques, are to be understood as protocols 
and procedures found in an ordinary laboratory manual such as: Current Protocols in 
Molecular Biology, editors F. Ausubel et al„ John Wiley and Sons, Inc. 1994, or Sambrook, 
J., Fritsch, E.R and Maniatis, T., Molecular Cloning: A laboratory manual, 2nd Ed., Cold 
Spring Harbor Laboratory Press, Cold Spring Harbor, NY 1989. 



EXAMPLES 

EXAMPLE 1 A: Cloning and sequencing of cDNA encoding human GABAg receptor la 
and lb 



Messenger RNA from human hippocampus was obtained from Clontech (Palo Alto, CA, 
15 USA) (catalog #6578-1). First-strand cDNA synthesis reactions were performed using the 
First-strand cDNA Synthesis kit from Amersham Pharmacia Biotech (Uppsala, Sweden). 
The pd(N) 6 primer was used to prime the first-strand synthesis. The generated cDN A 
molecules were used as template in the PCR reactions described below. 

20 Specific PCR primers were designed (Table 1) based on the sequences of the rat GABAb 
receptor la and lb cDNA (Kaupmann et al. 1997, EMBL accession numbers Y10369 (SEQ 
ID NO: 44) and Y10370 (SEQ ID NO: 46)). 
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TABLE 1 

Primers used for RT-PCR on mRNA from human hippocampus 



Nr. 


Species 


Sequence 5 * - 3 ' 


SEQ ID NO 


794 


Rat 


GTTTC TT C TC GGAT CCAGC TGTGC CTG 


1 


795 


Rat 


C AGGCAC AGCTGGATC C GAGAAGAAACT 


2 


796 


Rat 


CGGTCGACTCACTTGTAAAGCAAATGTACTCGACTCCC 


3 


797 


Rat 


ATGCGCGCCGGCAGCCAACATGCTGCTGCTGCTGCTGGTGC 


4 


831 


Rat 


CGGTCGACTCACTTGTAAAGCAAATGTACTCGACTCCCATCACAGC 


5 


838 


Rat 


ATGCGCGCCGGCAGCCAACATGCTGCTGCTGCTGCTGGTGCCTCTCTTCC 


6 


842 


Rat 


C AGGCACAGCTGGATC CGAGAAGAAACTCTGTCGGAAAGT 


7 


863 


Rat 


GGTCATCCAGCGTTGAGGTGAAGAC 


8 


864 


Rat 


G AAGGTTGC C AG ATTAT ACATC CGC 


9 


865 


Rat 


C C ACGATGATTC GAGC ATCTTGACG 


10 


866 


Rat 


GCCTCTCACTCCCCTCATCTCC 


11 


932 


Human 


GAGTGAAGGAGGCTGGAATTG 


12 



5 cDN A fragments encoding parts of the human GAB A B receptors were amplified directly 
by PCR using the designed primers with the generated cDNA molecules as templates. All 
PCR experiments were carried out using the Perkin Elmer Taq DNA polymerase with Gene 
Amp (Roche Molecular Systems Inc., NJ, USA) with the following PCR program: +95°C 
for 1 minute, +50°C for 30 seconds, +72°C for 3 minutes, repeated 44 times and finally 

10 +72°C for 7 minutes. The following primer combinations gave PCR products with 

expected sizes: 838 and 842, 838 and 795, 797 and 865, 864 and 865, 864 and 863 which 
corresponds to the 5 '-end of the GABA B receptor la. Primer pair 932 and 83 1 , 932 and 
796, 794 and 83 1 gave PCR products which correspond to the 3 '-end of both GABA B 
receptor la and lb. The primer combination 839 and 918 gave a PCR product 

15 corresponding to the 5' -end of the GABA B receptor lb. 

The PCR products were subcloned into the pGEM-T vector from Promega (Madison, 
USA). The inserts were subjected to nucleotide sequence analysis, the complete nucleotide 
sequence for all subclones were determined using a Thermo Sequenase dye terminator 
20 cycle sequencing pre-mix kit (Amersham Pharmacia Biotech, Uppsala, Sweden). As 

primers for sequencing reactions specific oligonucleotides complementary to the vector 
pGEM-T or primers complementary to the cDNA encoding the GABAg receptor were 

used. 
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Additional PCR primers were designed based on the obtained sequences encoding 
fragments of the human GABA B receptor and additional DNA fragments encoding parts of 
the human GAB A B receptors were amplified by PCR, the PCR products subcloned and 
sequenced as described above. 

EXAMPLE IB : Cloning and sequencing of the 3' -ends of the cDNA encoding human 
GABAb receptors la and lb 

Messenger RNA from human hippocampus was obtained from Clontech (Palo Alto, CA, 
USA) (catalogue #6578-1). First strand cDNA synthesis reactions were performed using 
the First-strand cDNA Synthesis kit from Amersham Pharmacia Biotech (Uppsala, 
Sweden). The Not I-d(T)i 8 primer was used to prime the first-strand synthesis. The 
generated cDNA molecules were used as template in the PCR reactions described below. 

Specific PCR primers were designed (Table 2 ) based on the sequences of the human 
GABAb receptor la and lb cDNA obtained in Example 1 A and the EST sequence EMBL 
accession number Yl 1044. 
20 

By homology searches in the EMBL database using the GABAb receptor cDNA sequences 
obtained in Example 1A as query the EST sequence EMBL accession number Yl 1044 
have been found to be homologous to the 3"-end of the GABAb receptor cDNA. 



25 TABLE 2 

Primers used in PCR to amplify 3 'ends of human GABAb receptor cDNA 



Nr 


Species 


Sequence 5'- 3' 


SEQ ID NO 


938 


Human 


GACGCTTATCGAGCAGCTTC 


13 


972 


Human 


AGCCCAGAACTCACAGGGGGACAT 


14 


973 


Human 


GCTTCAAGCCAGGTACGAACTAA 


15 



cDNA fragments encoding parts of the human GABAb receptors were amplified directly by 
30 PCR using the designed primers with the generated cDNA molecules as templates. All 
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PCR experiments were carried out using the Perkin Elmer Taq DNA polymerase with Gene ~ 
Amp (Roche Molecular Systems Inc., NJ, USA) with the following PCR program: + 95°C 
for 1 minute, +50°C for 30 seconds,+72°C for 3 minutes, repeated 44 times and finally 
+72°C for 7 minutes. The following primer combinations gave PCR products with 
5 expected sizes: 938 and 972, 938 and 973 corresponding to the 3 end of both GABAb 
receptor la and lb cDNA. 

The PCR products were subcloned into the pGEM-T vector from Promega (Madison, 
USA). The inserts were subjected to nucleotide sequence analysis, the complete nucleotide 
10 sequence for all subclones were determined using a Thermo Sequenase dye terminator 
cycle sequencing pre-mix kit (Amersham Pharmacia Biotech, Uppsala, Sweden). As 
primers for sequencing reactions specific oligonucleotides complementary to the vector 
pGEM-T or primers complementary to the cDNA encoding the GABAb receptor were 
used. 

15 

EXAMPLE 1C : Cloning and sequencing of the 5' -end of the cDNA encoding human 
GABAb receptor lb 

20 Messenger RNA from human hippocampus was obtained from Clontech (Palo Alto, CA, 
USA) (catalogue #6578-1). Marathon cDNA amplification Kit (Clontech) was used for 
performing 573'- RACE (Rapid Amplification of cDNA Ends). Adaptor-ligated double 
stranded cDNA molecules were amplified according to the manufacturers description. The 
pd(N)6 primer from the First-strand cDNA Synthesis kit from Amersham Pharmacia 

25 Biotech (Uppsala, Sweden) was used to produce the adaptor-ligated cDNA. 

A specific PCR primer was designed (Table 3) based on the sequences of the human 
GABAb receptor lb cDNA obtained in Example 1 A. 
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TABLE 3 

Primers used in PCR to amplify the 5 '-ends of human GABA B receptor cDNA 



Nr. 


Species 


Sequence 5 '-3' 


SEQ ID NO 


958 


Human 


TGGCCCTCCACCGCCTCAGTCATCTCA 


16 


API 


Marathon kit 


CCATCCTAATACGACTCACTATAGGGC 


17 



5 cDN A fragments encoding part of the human GAB As receptors were amplified directly by 
PCR using the designed primers with the generated adaptor-ligated cDNA molecules as 
template. The PCR experiment was carried out using the Expand Long Template PCR 
System (Boehringer Mannheim GmbH, Germany) with the following PCR program: + 
94°C for 1 minute, + 94°C for 30 seconds, +60°C for 30 seconds, +68°C for 4 minutes, 

10 repeated 24 times. The primer combination API and 958 gave a PCR product which 
corresponded to the 5 '-end of the GABAb receptor lb cDNA, including 190 base pairs 
upstream the initiation codon. 

The PCR products were subcloned into the pGEM-T vector from Promega (Madison, 
15 USA). The inserts were subjected to nucleotide sequence analysis, the complete nucleotide 
sequence for all subclones were determined using a Thermo Sequenase dye terminator 
cycle sequencing pre-mix kit (Amersham Pharmacia Biotech, Uppsala, Sweden). As 
primers for sequencing reactions specific oligonucleotides complementary to the vector 
pGEM-T or primers complementary to the cDNA encoding the GABAb receptor were 
20 used. 

EXAMPLE ID : Cloning and sequencing of the 5' -end of the cDNA encoding human 
GABAb receptor la 

25 

Messenger RNA from human hippocampus was obtained from Clontech (Palo Alto, USA) 
(catalogue #6578-1). Marathon cDNA amplification Kit (Clontech) was used to obtain 
adaptor-ligated double stranded cDNA molecules according to the manufacturer's 
description. The pd(N) 6 primer from the First-strand cDNA Synthesis kitfrom Amersham 
30 Pharmacia Biotech (Uppsala, Sweden) was used to obtain the adaptor-ligated cDNA . 
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Specific PCR primers were designed (Table 4) based on the sequences of the human 
GABAb receptor la cDNA obtained in Example 1 and the rat GABAb receptor la cDNA 
disclosed in WO 97/46675. 

5 

TABLE 4 

Primers used to amplify 5 '-ends of the human GABAB receptor la cDNA 



Nr. 


Species 


Sequence 5 '-3' 


SEQIDNO 


1033 


Human 


CTCAATCTCATAGTCCACTGG 


18 


1087 


Rat 


CCTTGAGGCCCGGGGAGAG 


19 



10 cDNA fragments encoding part of the human GABAB la receptor were amplified directly 
by PCR using the designed primers with the generated adaptor-ligated cDNA molecules as 
template. The PCR was performed using the Perkin Elmer Taq DNA polymerase with 
Gene Amp (Roche Molecular Systems Inc., NJ, USA) with the following PCR program: 
+94°C for 1 minute, +50°C for 30 seconds,+72°C for 3 minutes; +94°C for 1 minute, +60°C 

15 for 30 seconds,+72°C for 4 minutes, repeated 34 times and finally +72°C for 7 minutes. 

The primer combination 1087 and 1033 gave a PCR product corresponding to the 5 '-end of 
the GABA B receptor la cDNA , including 26 base pairs upstream the initiation codon. 

20 The PCR products were subcloned into the pGEM-T vector from Promega (Madison, 

USA). The inserts were subjected to nucleotide sequence analysis, the complete nucleotide 
sequence for all subclones were determined using a Thermo Sequenase dye terminator 
cycle sequencing pre-mix kit (Amersham Pharmacia Biotech, Uppsala, Sweden). As 
primers for sequencing reactions specific oligonucleotides complementary to the vector 

25 pGEM-T were used. 

A complete cDNA sequence encoding the human GABAg receptors la (SEQ ID NO: 48) 
and the human GABA B receptor lb (SEQ ID NO: 50) were obtained by aligning the 
sequences of the different fragments cloned and sequenced in Examples 1 A, IB, 1C and 
30 ID. 
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EXAMPLE 2A: Cloning and sequencing of cDNA encoding canine GABAb receptor la 

QuickPrep Micro mRNA Purification kit (Amersham Pharmacia Biotech, Uppsala, 
Sweden) was used to isolate mRNA from canine neural tissues according to the 
manufacturers description. First-strand cDNA synthesis reactions were performed using the 
First-strand cDN A Synthesis kit from Amersham Pharmacia Biotech (Uppsala, Sweden). 
The Not-I-d(T)i8 bifunctional or pd(N) 6 primer was used to prime the first-strand synthesis. 
The generated cDNA molecules were used as template in the PCR reactions described 
below. 

Specific PCR primers (Table 5) were designed based on the sequences of the rat GABA B 
receptor la and lb cDNA (Kaupmann et al. 1997, EMBL accession numbers Y 10369 (SEQ 
ID NO: 44 and Y1037O (SEQ ID NO: 46)). 

TABLE 5 

Primers used for RT-PCR on mRNA from canine cortex 



Nr. 


Species 


Sequence 5 '-3' 


SEQ ID NO 


795 


Rat 


CAGGCACAGCTGGATCCGAGAAGAAACT 


20 


831 


Rat 


CGGTCGAC TC ACTTGTAAAGC AAATGTACTCGAC TCCC ATC AC AGC 


21 


838 


Rat 


ATGCGCGCCGGCAGCCAACATGCTGCTGCTGCTGCTGGTGCCTCTCTTCC 


22 


840 


Rat 


CGTCAAGATGCTCGAATCATCG 


23 


841 


Rat 


CAGGGGGCTCAGAGGGTCCC 


24 


842 


Rat 


CAGGCACAGCTGGATCCGAGAAGAAACTCTGTCGGAAAGT 


25 


844 


Rat 


CGGTCGACTCACTTGTAAAGCAAATGTACTCGACTCCCATCACAGCTAAG 


26 


848 


Rat 


ACTTTCCGACAGAGTTTCTTCTCGGATCCAGCTGTGCCTG 


27 


865 


Rat 


CCACGATGATTCGAGCATCTTGACG 


28 



cDNA fragments encoding parts of the canine GABA B receptor were amplified directly by 
PCR using the designed primers with the generated cDNA molecules as templates. All 
PCR experiments were carried out using the Perkin Elmer Taq DNA polymerase with Gene 
Amp (Roche Molecular Systems Inc., NJ, USA) with the following PCR program: +95°C 
for 1 minute, +50°C for 30 seconds, +72°C for 3 minutes, repeated 44 times and finally 
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+72°C for 7 minutes. The following primer combinations gave PCR products with 
expected sizes: 842 and 838, 838 and 795, 838 and 865 which corresponds to the 5 '-part of 
the canine GABA B receptor cDNA . Primer pair 848 and 844, 848 and 831, 848 and 841, 
840 and 841 gave PCR products which correspond to the 3 "-part of the canine GABAg 
5 receptor cDNA. 

The PCR products were subcloned into the pGEM-T vector from Promega (Madison, 
USA). The inserts were subjected to nucleotide sequence analysis, the complete nucleotide 
sequence for all subclones were determined using a Thermo Sequenase dye terminator 
10 cycle sequencing pre-mix kit (Amersham Pharmacia Biotech, Uppsala, Sweden). As 
primers for sequencing reactions specific oligonucleotides complementary to the vector 
pGEM-T or primers complementary to the cDNA encoding the GAB Ag receptor were 
used. 



15 

EXAMPLE 2B: Cloning and sequencing of the 3'- and 5' -ends of the cDNA encoding 
canine GABA B receptor la 

QuickPrep Micro mRNA Purification kit (Amersham Pharmacia Biotech, Uppsala, 
20 Sweden) was used to isolate mRNA from canine nerve tissues according to the 

manufactures description. Marathon cDNA amplification Kit (Clontech, Palo Alto, CA, 
USA) were used for performing both 5 '-and 3 '-RACE. Two adaptor-ligated double 
stranded cDNA libraries were amplified according to the manufacturers description. A 
random primer (pd(N) 6 ) was used when amplifying the adaptor-ligated cDNA for the 5'- 
25 RACE and the Marathon cDNA Synthesis primer (52-mer) was used when amplifying the 
adaptor ligated cDNA for the 3* -RACE. 

Specific PCR primers were designed (Table 6) based on the sequence of canine GABAb 
receptor 1 a cDN A obtained in Example 2A. 

30 
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TABLE 6 

Primers used in PCR to amplify the 5'- and 3'-ends of canine GABAb receptor la cDNA 



Nr. 


Species 


Sequence 5 '-3' 


SEQ ID NO 


936 


canine 


CTACCGCGCAATGAACTCCTCGTC 


29 


1076 


canine 


CGAGGTGGCGTTGGGGGTCTGTGC 


30 


API 


Marathon kit 


CCATCCTAATACGACTCACTATAGGGC 


31 


AP2 


Marathon kit 


ACTCACTATAGGGCTCGAGCGGC 


32 



5 cDNA fragments encoding parts of the canine GABAb receptor were amplified by PCR 
from the adaptor-ligated cDNA using the designed primers. A number of different PCR 
programs were tested to find conditions where PCR products corresponding to GABAb 
receptor DNA were obtained. The 5'- PCR experiments were carried out using the Expand 
Long Template PCR System (Boehringer Mannheim GmbH, Germany) with the following 

10 PCR program: +94°C for 30 seconds, +72°C for 3 minutes, repeated 4 times; +94°C for 30 
seconds, +70°C for 3 minutes, repeated 4 times; and +94°C for 30 seconds, +68°C for 
3minutes, repeated 24 times. The primer combination AP2 and 1076 gave a PCR product 
which corresponded to the 5 '-end of the GABAb receptor cDNA, including 1 14 base pairs 
upstream the initiation codon. 

15 

The 3'- PCR experiments were carried out using the Expand Long Template PCR System 
(Boehringer Mannheim GmbH, Germany) with the following PCR program: +94°C for 1 
minute; and +94°C for 30 seconds, +60°C for 30 seconds,+68 °C for 4 minutes, repeated 
29 times. The primer combination API and 936 gave a PCR fragment which corresponded 
20 to the 3 '-end of the GAB A B receptor cDNA, including the poly (A) tail. 



The PCR products were subcloned into the pGEM-T vector from Promega (Madison, 
USA). The inserts were subjected to nucleotide sequence analysis, the complete nucleotide 
sequence for all subclones were determined using a Thermo Sequenase dye terminator 
25 cycle sequencing pre-mix kit(Amersham Pharmacia Biotech, Uppsala, Sweden). As 

primers for sequencing reactions specific oligonucleotides complementary to the vector 
pGEM-T or primers complementary to GABAb receptor DNA were used* 
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Complete cDNA sequence encoding the canine GABAb receptor la (SEQ ID NO: 52) was 

obtained by aligning the sequences of the different fragments obtained in Example 2A and 
Example 2B. 

5 

EXAMPLE 3 A: Cloning of cDNA encoding human GAB As receptor 1c and Id from 
Jurkat cells 

A guanidinisothiocyanate/CsCl method was used to isolate total RNA from Jurkat cells. 
10 The first-strand cDNA synthesis was performed using the First-strand cDNA Synthesis kit 
from Amersham Pharmacia Biotech (Uppsala, Sweden). The pd(N)s primer was used to 
prime the first strand synthesis. The generated cDNA molecules were used as templates in 
the PCR reaction described below. 

15 Specific PCR primers (Table 7) were designed based on the sequence of human GAB As 
receptor la and lb cDNA (Example 1), rat GABAb receptor (Kaupmann et al. 1997) and 
the EST sequence EMBL accession number Yl 1044. 

TABLE 7 

20 Primers used in RT-PCR on mRNA from Jurkat cells 



Nr 


Species 


Sequence 5*-3' 


SEQ ID NO 


938 


human 


GACGCTTATCGAGCAGCTTC 


33 


972 


human 


AGCCCAGAACTCACAGGGGGACAT 


34 


973 


human 


GCTTCAAGCCAGGTACGAACTAA 


35 


893 


rat 


GGAGCACCCCCAAGCCCCACTG 


36 


937 


human 


CTGGTTCCTCCCAATGTG 


37 


1005 


rat 


CCTCTCACTCCCCTCATCTC 


38 


1030 


human 


AAGCCAACCTTCCCTGCTTCTC 


39 



cDNA fragments encoding parts of the GABAb receptor were amplified directly by PCR 
using human and rat specific primers. All PCR experiments were carried out using the 
25 Perkin Elmer Taq DNA polymerase with Gene Amp (Roche Molecular Systems Inc., NJ, 
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USA) with the following PCR program: +95°C for 1 minute; +54°C for 1 minutes, +72°C 
for 3 minutes, repeated 44 times; and finally +72°C for 7 minutes. 

The PCR products were subcloned into the pGEM-T vector from Promega (Madison, 
USA). The inserts were subjected to nucleotide sequence analysis, the complete nucleotide 
sequence for all subclones were determined using a Thermo Sequenase dye terminator 
cycle sequencing pre-mix kit (Amersham Pharmacia Biotech, Uppsala, Sweden). As 
primers for sequencing reactions specific oligonucleotides complementary to the vector 
pGEM-T or primers complementary to GAB As receptor DNA were used. 



The following primer combination gave PCR products corresponding to the 3 '-end of the 
GABAb receptor cDNA: primer pairs 938 and 972; 938 and 973. Unexpectedly both these 
fragments lacked 149 base pairs, resulting in a frame shift and the insertion of a new 
termination codon. The following primer combination gave a PCR product corresponding 
15 to the 5 '-part of the GABAb receptor la cDNA: 893 and 937. The primer pairs 1005 and 
937, 1030 and 937 gave PCR products corresponding to the 5 '-part of the GABAb receptor 
lb cDNA. Also these PCR fragments lacked the same 149 base pairs which resulted in a 
frame shift and the insertion of a new termination codon. 



20 These results show that Jurkat cells contain mRNA encoding two new forms of the human 
GABAb receptor. These are designated GABAb receptor lc (SEQ ID NO: 54 and 55) 
(comprising the same 5' -part as the GABAb receptor la) and GABAb receptor Id (SEQ ID 
NO: 56 and 57) (comprising the same S'-part as the GABAb receptor lb). These two forms 
of the GABAb receptor do not contain any of the transmembrane regions and are therefore 

25 expected to be soluble forms of the receptor. 



EXAMPLE 3B: Analysis of cDNA encoding human GABAb receptors from hippocampus 

30 Messenger RNA from human hippocampus was obtained from Clontech (Palo Alto, USA) 
(catalogue #6578-1). First strand cDNA synthesis reactions were performed using the First- 
strand cDNA Synthesis kit from Amersham Pharmacia Biotech (Uppsala, Sweden). The 
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pd(N) 6 primer was used to prime the first-strand synthesis. The generated cDNA molecules "~ 
were used as template in the PCR reactions described below. 



Specific PCR primer was designed (Table 8) based on the sequences of the cDN As 
5 encoding human GABAb receptors la and lb. 

TABLES 

Primers used for RT-PCR on mRNA from human hippocampus 



Nr 


Species 


Sequence 5' - 3* 


SEQ ID NO 


937 


Human 


CTGGTTCCTCCCAATGTG 


40 


938 


Human 


GACGCTTATCGAGCAGCTTC 


41 



10 

cDNA fragments encoding parts of the human GABAb receptors were amplified directly by 
PCR using the designed primers with the generated cDNA molecules as templates. All 
PCR experiments were carried out using the Perkin Elmer Taq DNA polymerase with Gene 
Amp (Roche Molecular Systems Inc., NJ, USA) with the following PCR program: +94°C 
15 for 1 minute, +50°C for 30 seconds,+72°C for 3 minutes; +94°C for 1 minute, +54°C for 30 
seconds, +72°C for 3 minutes repeated 44 times; and finally +72°C for 7 minutes. The 
primer combination 938 and 937 gave a PCR product which corresponded to the expected 
size of the GABAb receptor la and lb cDNA but also a fragment a smaller size. 

20 The PCR products were subcloned into the pGEM-T vector from Promega (Madison, 

USA). The inserts were subjected to nucleotide sequence analysis, the complete nucleotide 
sequence for all subclones were determined using a Thermo Sequenase dye terminator 
cycle sequencing pre-mix kit (Amersham Pharmacia Biotech, Uppsala, Sweden). As 
primers for sequencing reactions specific oligonucleotides complementary to the vector 

25 pGEM-T or primers complementary to the cDNA encoding the GABAb receptor were 
used. 

The larger PCR fragment was found to correspond to the 3' -part of the GABAb receptor la 
and lb cDNA and the smaller fragment which lacked 149 base pairs was found to 
30 correspond to the 3' -part of the GABAb receptors lc and Id cDNA identified in Example 
3A. 
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EXAMPLE 4: Cloning and sequencing of cDNA encoding canine GABAb receptor lb 

5 cDNA encoding the canine GAB A3 receptor lb is isolated in a similar manner as 
described in Example 2 for receptor la. PCR primers specifically designed to be 
complementary to the 5' -end of the cDNA encoding the rat and human GABAb receptor 
lb, together with PCR primers complementary to the 3' -end of the cDNA encoding the 
canine GABA B receptor la, and mRNA prepared from a suitable canine tissue, are used. 

10 

EXAMPLE 5: Cloning of cDNA encoding canine GABAb receptor lc 

Total RNA from canine liver was prepared using the RNeasy Total RNA Purification 
1 5 Protocols (Quiagen GmbH, Germany). The first-strand cDNA synthesis was performed 
using the First-strand cDNA Synthesis kit from (Amersham Pharmacia Biotech, Uppsala, 
Sweden). The pd(N) 6 primer was used to prime the first -strand synthesis. The generated 
cDNA molecules were used as templates in the PCR reaction described below. 

20 Specific PCR primers (Table 9) were designed based on the sequence of canine GABAb 
receptor lacDNA. 

TABLE 9 

Primers used in RT-PCR 

25 



Nr 


Species 


Sequence 5 '-3* 


sectid no 


936 


canine 


CTACCGCGCAATGAACTCCTCGTC 


42 


954 


canine 


CCTTCTTCTCCTCCTTCTTAGTGA 


43 



cDNA fragments encoding parts of the canine GABAb receptor were amplified directly by 
PCR using canine specific primers. All PCR experiments were carried out using the Perkin 
Elmer Taq DNA polymerase with Gene Amp (Roche Molecular Systems Inc., NJ, USA) 
30 with the following PCR program: +95°C for 1 minute, +54°C for 30 seconds, +72°C for 3 
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minutes, repeated 44 times and finally +72°C for 7 minutes. The primer combination gave 
a PCR product with a size corresponding to the GABA B receptor la but also a fragment of 
smaller size indicating the presence of a GABAb receptor lc. 

5 The PCR products were subcloned into the pGEM-T vector from Promega (Madison, 

USA). The inserts were subjected to nucleotide sequence analysis, the complete nucleotide 
sequence for all subclones were determined using a Thermo Sequenase dye terminator 
cycle sequencing pre-mix kit (Amersham Pharmacia Biotech, Uppsala, Sweden). As 
primers for sequencing reactions specific oligonucleotides complementary to the vector 
10 pGEM-T were used. 

The smaller fragment was shown to lack 149 base pairs. This deletion caused a frame shift 
and an insertion of a new termination codon, verifying the existence of a canine GAB A© 
receptor lc. 

15 

Complete cDNA sequence encoding the canine GABAg receptor lc (SEQ ID NO: 58) was 
obtained by aligning the sequences of the fragments obtained in Example 2A, Example 2B 
and Example 5. 

20 

EXAMPLE 6. Cloning, sequencing and organization of human GABAb receptor genomic 
fragments 

To determine the structural organization and sequence of the human GABAb receptor gene, 
25 human genomic DNA libraries and human genomic DNA were screened and analyzed. 
Human genomic libraries were obtained from Clontech (Palo Alto, CA, USA). The 
libraries were constructed from female leukocyte DNA (catalog # HL1 1 1 1 J), cloned into 
AEMBL-3 vector. The average size of inserts are 16 kb and the number of independent 

clones are 1.7xl0 6 . Human genomic DNA was obtained from Clontech (catalog # 6550-1). 
30 In order to isolate recombinant phages containing exon and intron sequences of the human 
GABAb receptor gene, 48 individual bacterial plates with a diameter of 150 mm and 
approximately 4 x 10 4 individual plaques per plate, were screened. The methods and 
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solutions used were as described in the Library Protocol Handbook: General Procedures for 
the Hybridization of Lambda Phage Libraries w/DNA Probes (Clontech) with some modi- 
fications as will be apparent from the following. 

5 The experiment was carried out essentially as follows. The numbers are given per plate 
basis. A sample of the phage library diluted in 0. 1 ml sterile lambda diluent was prepared 
in order ttPobtain an estimated titer of 40,000 pfu (plaque forming units). A 0.6 ml LB- 
medium culture of the E, coli host strain K802 (obtained from Clontech) was infected with 
40000 pfu recombinant phages for 15 minutes at +37°C. The culture was then mixed with 

10 7 ml top agarose (6.5 g of agarose added per liter LB) and poured onto LB plates. The 

plates were incubated at +37°C for approximately 7 hours. The plates were then chilled at 
+4°C. 

Plaque hybridization experiments were as follows. Membrane filters, Colony/Plaque 
15 Screen (DuPont, Wilmington, DE, USA), were placed onto the top of the plates for 3 

minutes. For denaturation of DNA the filters were removed and floated in 0.5 M NaOH on 
a plastic wrap for 2 minutes, with the plaque side up. This step was repeated once to ensure 
efficient denaturation. Following neutralization the membrane filters were placed in 1M 
Tris-HCl pH 7.5, two times 2 minutes and allowed to dry. 

20 

To obtain probes for DNA hybridization screening of the membrane filters, a GABAb 
receptor cDNA clone was digested with SacII and a 479 bp fragment (base pairs 573-1051 
in the cDNA encoding human GABA B receptor la, SEQ ID NO: 48) was separated. by 
agarose electrophoresis, excised and transferred to a polypropylene microcentrifuge tube. 

25 Additional probes were obtained by PCR amplification of various regions of the GAB Ab 
receptor cDNA (base pairs 68-486 and 2368-2863 in the cDNA encoding human GABAb 
receptor la, SEQ ID NO: 48). The isolated cDNA fragment was 32 P-labeled using 
Megaprime DNA labeling system (Amersham Pharmacia Biotech, Uppsala, Sweden) by 
the following procedure. Water was added at a ratio of 3 ml per gram of gel, and placed in 

30 a boiling water bath for 7 minutes to melt the gel and denature the DNA. A volume of 
DNA/agarose solution containing 25 ng of DNA was added to the labeling reaction, 
according to the supplier's instructions. Labeled nucleotides were removed from DNA 
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labeling reactions using MicroSpin™ G-50 Columns (Amersham Pharmacia Biotech, ~ 
Uppsala, Sweden ). 

The DNA hybridization reaction was performed under stringent conditions according to the 
5 method described below. The filter membranes were prehybridized at +65°C for at least 1 
hour in a solution composed of 1% SDS, 1M NaCl, and 10% dextran sulfate using a 
hybridization oven (Hybaid Ltd, Ashford, UK). Following prehybridization a solution 
containing denatured herring sperm DNA of a final concentration of 100 ng/ml and the 
32p-iabeled DNA probe at a concentration <10 ng/ml (for optimal signal to background 

10 ratio) was added to the prehybridization solution and the membrane filters were incubated 
at +65 °C for 10-20 hours. Following the removal of the hybridization solution the 
membrane filters were first washed in a 2xSSC (0.3M NaCl, 0.03M Na-citrate), 1% SDS 
solution two times for 5 minutes at room temperature. In the next step, the membrane 
filters were incubated +60°C two times for 30 minutes each in the same solution. In a third 

15 step, the filters were washed two times at room temperature in O.lxSSC. Finally, the 

membrane filters were placed on a sheet of filter paper with the DNA face up, and allowed 
to dry. The dried membrane filters were then exposed to X-ray films and autoradiographed. 

Of the approximately 2x10^ individual plaques analyzed, four hybridizing plaques were 
20 detected and isolated. These three isolates were designated #GR1, #GR12, #GR13 and 

#GR4h respectively. After several rescreening experiments, the recombinant phage DNA 
was purified using Qiagen Lambda Midi Kit (Qiagen GmbH, Germany). The purified DNA 
was digested with Sail and the fragments representing the inserts were isolated by agarose 
electrophoresis. 

25 

The sizes of the inserts were approximately isolate #GR1, 12 kb, isolate #GR12, 12 kb; 
isolate #GR13, 16kb; and isolate #GR41, 19 kb. These fragments were cloned into Sail 
digested linearized pUC19, resulting in the plasmids pAM362 (isolate #GR1), pAM363 
(isolate #GR12), pAM364 (isolate #GR13)and pAM365 (isolate #GR41). The inserts from 
30 the four plaques which hybridizes to the GAB Ab receptor cDNA probes were analyzed by 
PCR, restriction mapping and hybridization to 32p_i a t> e i e d DNA fragments representing 
various regions of the GAB Ab receptor gene. 
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The cloned fragments in the plasmids pAM362, pAM363, pAM364 and pAM365 were 
characterized by restriction enzyme mapping, using EcoRI, Hindin, PstI, and BamHI. The 
approximate positions of the exons and the approximate size of the introns were analyzed 
5 and determined by PCR-based exon-exon linking and agarose gel electrophoresis. 

In order to facilitate nucleotide sequence analysis, 7 restriction sub-fragments derived from 
pAM364, 2 restriction fragments derived from pAM362, and 1 restriction sub-fragment 
derived from pAM365 were isolated and cloned into pUC19, resulting in the plasmids 
10 pAM366-pAM375. The following strategy was employed; by combining PCR primers 

located within the pUC19 sequence either upstream or downstream of the cloning site, with 
a PCR primer with defined orientation and specific for the GAB Ab receptor derived 
subcloned fragment allowed the determination. 

15 The inserts in the 10 plasmids pAM366-pAM375 were subjected to nucleotide sequence 
analysis. The nucleotide sequences for all subclones were determined using a Thermo 
Sequenase dye terminator cycle sequencing pre-mix kit (Amersham Pharmacia Biotech, 
Uppsala, Sweden). As primers for sequencing reactions specific oligonucleotides comple- 
mentary to pUC19 or primers complementary to the GABAb receptor cDNA were used. 

20 

The genomic fragments cloned in the plasmids pAM362-pAM365 where shown to contain 
the complete transcribed part of the human GABAb receptor gene and extends > 3kb 
upstream of the first exon and > 2kb downstream of the last exon. The fragments cloned in 
the plasmid pAM362 were shown to contain exons 7-1 1, pAM363 exons 12-22, pAM364 
25 exons 1-11, and pAM365 exons 12-23 of the GABAb receptor gene (Figure 1). The 

sequence of exons 1-11 and introns 1-10 is listed in SEQ ID NO: 60 and the sequence of 
exons 12-23 and introns 12-22 is listed in SEQ ID NO: 61. 

The human GABAb receptor gene consists of 23 exons and 22 introns (Figure 1). The size 
30 of the exons range between 21 bp and 1486 bp. As can be noted from Table 10 exon/intron 
boundaries are in accordance with the AG/GT rule and conform well to the consensus 
sequence suggested by Mount et al. 1982. 
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TABLE 10 

Exon-Intron boundaries of the GABAb receptor gene, sequences at exon-intron junctions. 







5 'splice donor 


3 'splice acceptor 




Exon 1— Eson 2 


CGAG 


GTAAGAG 


CCGCCTCTCACTTAG 


ATGT 


Exon 2 -Exon 3 


GAAG 


GTGCATC...- 


- CGACTC AC CCCTTAG 


GTTG 


Exon 3 -Exon 4 


TGTG 


GTGAGTA 


CCWATCTCTCCACAG 


TCCG 


Exon 4 -Exon 5 


CAGG 


GTGAGGG 


CTTTC CTGCTGCC AG 


TOAA 


Vvfln *> — Bvnn £ 

rlnWII ~J BAMU V 


TCAG 


GTGAGAT 


cnr* a ^ , ^ , ^ , ^ , 'Tv^^^■•/• , t\ r- 


AACG 


Exon 6 -Exon 7 


CAAG 




CCTCTTGTCTTTCAG 


TGTG 


Exon 7 -Exon 8 


TGTG 


GTAAGCA 


CTCCCTGCCCCACAG 


CTTT 


Exon 8 -Exon 9 


TTCG 


GTGAGGA 


TTATTC CC AC CCAAG 


ACTC 


Exon 9- Exonl 0 


GAAG 


GTCAGAT „ „ 


. .CTTTCTCTGTKGTAG 


CGCC 


Exonl 0 -Exonl 1 


TGAG 


GTGGART 


CTCCTCTGTATTCAG 




Bxonll-Exonl2 


CATG 


G TGAG AG 


TTTTTTCCTCCCAAG 


ACAT 


Exonl2 -Exonl 3 


CTCT 


GTGAGTT 


— .TGTTCCTTCCCTCAG 


GGCC 


Exonl3 -Exonl4 


CAGG 


GTTAGTA 


— TTGTCGTCTGCCCAG 


GTGG 


Exonl 4 -Exonl 5 


ATTG 


GTGAGTG 


... CC CTGTGC CATGC AG 


GAGG 


Exonl5-Exonl6 


TCCG 


GTXAfiTT^ 


CCACCTCTGCCCTAG 


TTAT 


Exonl 6 -Exonl 7 


CCAG 


GTGAGGA 


TCTCTTCCTTTCTAG 


GCCC 


Exonl7 -Exonl 8 


GAAG 


GTGAGCT 


CACATATTTATCCAG 


ACTC 


Exonl 8 - Exonl 9 


TGAG 


GTACCAC 


TYGTTTCTGCCCTAG 


ACAT 


Exon 19 - Exon2 0 


CTTG 


GTGTGTG 


CTCCTGCCATCCTAG 


GCAT 


Exon2 0 -Exon2 1 


GGCA 


GTGAGGA. 


TGTCTTTCCCTCTAG 


GTCC 


Exon21-Exon22 


CAAG 


GTAAGGA 


AACATTTGCCCCCAG 


ATGC 


Exon2 2 -Exon2 3 


TGAG 


GTGCGGG. 


TGCTTCTTCCTCCAG 


AAAG 



A comparison of the sequences of the different forms of the human GABAb receptor 
cDNA (SEQ ID NO: 48, 50, 54, and 56) and the sequence of the human GABAb gene 
(SEQ ID NO: 60 and 61) reveals that different mRNA encoding human GABAb receptors 
are formed by alternative splicing. The translational start of the GABAb receptor la is 
localized in exon 2 and the translational stop is localized in exon 23. The mRNA encoding 
GABAb receptor lb is formed by alternative splicing where part of intron 5 is retained 
together with exon 6 where the translational start of the GABAb receptor lb is derived 
from the intron sequence. The mRNA encoding GABAb receptor 1c is formed by 
alternative splicing where exon 15 are removed together with introns 14 and 15 and a 
frame shift and a translational stop are generated in the sequence corresponding to exon 16. 
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The mRNA encoding GABA B receptor Id is formed when the translational start of the 
GABAb receptor lb is generated together with the translational stop of the GABAb 
receptor lc. 

5 The la isoform is formed by splicing exon 5 to a cryptic splice site in the middle of exon 6. 
Transcription of lb isoform mRNA is most likely initiated from regulatory elements in 
intron 5. The ATG initiating translation of lb mRNA is located in the 5' -end of exon 6. 

Additional variants of the mRNA encoding variants of the human GABAb receptor can be 
10 derived by alternative splicing where one ore more of the exons or part of exons are 

excised in the processing of the pre-mRNA. The subsequent transcription of these mRNA 
will give rise to variants of the human GAB A B receptor with potentially different 
biological and/or pharmacological activities. 

15 

EXAMPLE 7. Analysis of cDNA encoding human GABAb receptors from human brain 

Messenger RNA from human fetal brain (catalog #6525-1) and adult human brain (catalog 
#6516-1) were obtained from Clontech (Palo Alto, CA, USA). First strand cDNA synthesis 
20 reactions were performed using the first strand cDNA Synthesis kit from Amersham 
Pharmacia Biotech (Uppsala, Sweden).The pd(N)e primer was used to prime the first- 
strand synthesis. The generated cDNA molecules were used as templates in the PCR 
reactions described below. 

25 Specific PCR primers was designed (Table 1 1 ) based on the sequences of the rat GAB A© 
receptor la and lb cDNA and human GABAb receptor la and lb cDNA 
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Primers used for RT-PCR on mRNA from human fetal brain 
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Nr 


Species 


Sequence 5 '-3' 


SEQ ID NO 


838 


Rat 


ATGCGCGCCCKjCAGCCAACATGCTGCTGCTGCTGCTGGTGCCTCT 
CTTCC 


62 


863 


Rat 


GGTCATCCAGCGTTGAGGTGAAGAC 


63 


864 


Rat 


GAAGG1TGCCAGATTATACATCCGC 


64 


865 


Rat 


CCACGATGATTCGAGCATCTTGACG 


65 


937 


Human 


CTGGTTCCTCCCAATGTG 


66 


1015 


Human 


CCAGTGGACTATGAGATTGAG 


67 



5 cDNA fragments encoding parts of the human GAB Ab receptors were amplified directly by 
PCR using the designed primers with the generated cDNA molecules as templates. PCR 
experiments with primers 838, 863 ,864 and 865 were carried out using the Perkin Elmer 
Taq DNA polymerase with Gene Amp (Roche Molecular System Inc., NJ, USA) with the 
following PCR program: +95°C for 1 minute, +50°C for 30 seconds,+72°C for 3 minutes, 

10 repeated 44 times and finally +72°C for 7 minutes. PCR experiments with primers 937 and 
1015 were carried out using the Expand Long Template PCR System (Boehringer 
Mannheim GmbH, Germany) with the following PCR program: +94°C for 2 minutes; 
+94°C for 10 seconds, +55°C for 30 seconds, +68°C for 3 minutes repeated 35 times; and 
finally +68°C for 7 minutes. The primer combinations 838 and 863, 864 and 863 , 864 and 

15 865, 937 and 1015 gave PCR products. 

The PCR products were subcloned into the pGEM-T vector from Promega (Madison, WI, 
USA). The inserts were subjected to nucleotide sequence analysis. The complete nucleotide 
sequence for all subclones were determined using a ThermoSequenase dye terminator cycle 
20 sequencing pre-mix kit (Amersham Pharmacia Biotech, Uppsala, Sweden). As fof primers 
for sequencing reactions specific oligonucleotides complementary to the vector pGEM-T or 
primers complementary to the cDNA encoding the GABAb receptor were used. 

A number of analyzed clones isolated from fetal brain were found to lack 1 86 base pairs, 
25 corresponding to exon 4. This alternative splicing resulted in a cDNA (SEQ ID NO: 70) 
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encoding a protein (SEQ ID NO: 71) comprising of 899 amino acids and designated ~ 
GABA B receptor le. 

Other clones from fetal brain lacked 368 base pairs, corresponding to exons 4, 5 and 6, 
resulting in a cDN A (SEQ ID NO: 72) where a frame shift and a translational stop codon 
are generated in the sequence corresponding to exon 7, encoding a protein (SEQ ID NO: 
73) comprising only 97 amino acids designated GABAb receptor If. 

One clone lacked 207 base pairs, corresponding to exons 4 and 5, resulting in a cDNA 
(SEQ ED NO: 74) encoding a protein (SEQ ID NO: 75) comprising 892 amino acids 
designated GABAb receptor lg. 

Another clone had two deletions, the first comprising 186 base pairs corresponding to exon 
4, the second comprising 39 base pairs corresponding to part of exon 6, resulting in a 
cDNA (SEQ ID NO: 76) encoding a protein (SEQ ID NO: 77) comprising 886 amino acids 
designated GABA B receptor lh. 



Yet another clone from adult human brain was found to have a long deletion comprising 
1 194 base pairs corresponding to base pairs 319 - 1512 of the cDNA encoding human 
GABAb receptor la. This deletion corresponds to part of exon 4, exons 5-11, and part of 
exon 12. This cDNA (SEQ ID NO: 78) encodes a protein (SEQ ID NO: 79) comprising 
563 amino acids designated GABA B receptor li. 

One clone isolated from fetal brain lacked 284 base pairs corresponding to part of exon 3 
and the whole exon 4, generating a frame shift and a translational stop codon in the 
sequence corresponding to exon 5. This cDNA (SEQ ID NO: 80) encodes a protein (SEQ 
ID NO: 81) comprising only 105 amino acids designated GABAb receptor lj. 



EXAMPLE 8. Analysis of cDNA encoding human GABAb receptors from Jurkat cells 

A guanidinisothiocyanate/CsCl method was used to isolate total RNA from Jurkat cells. 
First strand cDNA synthesis reactions were performed using the first strand cDN A 
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Synthesis kit from Amersham Pharmacia Biotech (Uppsala, Sweden). The pd(N)fc primer 
was used to prime the first-strand synthesis. The generated cDNA molecules were used as 
templates in the PCR reactions described below. 

5 Specific PCR primers was designed (Table 12) based on the sequences of the human 
GABAb receptor la and lb cDNA 

TABLE 12 

Primers used for RT-PCR on mRNA from Jurkat cells 

10 



Nr 


Species 


Sequence 5 '-3' 


SEQ ID NO 


937 


Human 


CTGGTTCCTCCCAATGTG 


68 


1015 


Human 


CCAGTGGACTATGAGATTGAG 


69 



cDNA fragments encoding parts of the human GABAb receptors were amplified directly by 
PCR using the designed primers with the generated cDNA molecules as templates. The 
PCR experiment was carried out using the Expand Long Template PCR System 
15 (Boehringer Mannheim GmbH, Germany) with the following PCR program: +94?C for 2 
minutes; +94°C for 10 seconds, +55°C for 30 seconds, +68°C for 3 minutes repeated 35 
times; and finally +68°C for 7 minutes. The primer combination 937 and 1015 gave a PCR 
product. 

20 The PCR products were subcloned into the pGEM-T vector from Promega (Madison, WI, 
USA). The inserts were subjected to nucleotide sequence analysis. The complete nucleotide 
sequence for all subclones were determined using a ThermoSequenase dye terminator cycle 
sequencing pre-mix kit (Amersham Pharmacia Biotech, Uppsala, Sweden). As for primers 
for sequencing reactions specific oligonucleotides complementary to the vector pGEM-T or 

25 primers complementary to the cDNA encoding the GABAb receptor were used. 

Two clones had two deletions, the first comprising 368 base pairs corresponding to exons 
4, 5 and 6, the second comprising 151 base pairs corresponding to exon 15 where a frame 
shift and a translational stop codon are generated in the sequence corresponding to exon 7. 
30 This cDNA (SEQ ID NO: 82) encodes a protein (SEQ ID NO: 83) comprising only 98 
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amino acids designated GABA B receptor lk which is identical to the GABA B receptor If 
described above. 

Another two clones also had two deletions, the first comprising 246 base pairs 
5 corresponding to part of exon 4, exon 5 and exon 6, the second comprising 149 base pairs 
corresponding to exon 15, generating a frame shift and a translational stop codon in the 
sequence corresponding to exon 16. This cDNA (SEQ ID NO: 84) encodes a protein (SEQ 
ID NO: 5) comprising 496 amino acids designated GABAb receptor 11. 

10 Additional variants of cDNA encoding the human GABAb receptors can be identified in a 
similar manner using PCR primers based on the sequence of the cDNA and genomic 
fragments encoding the human GABAb receptors disclosed in the present application. 

The biological activity of these variants of the human GABAb receptor can be evaluated by 
15 transfection of suitable host cells with expression vectors comprising the corresponding 
cDNA sequences and subsequent measurement of binding of labeled ligands or activation 
of the receptor or modulation of receptor function. 



20 EXAMPLE 9: Generation of antibodies 

Antibodies were raised in rabbits against four different BS A-conjugated 20 amino acids 
long synthetic peptides corresponding to selected regions of the human and canine GABAb 
receptor extracellular domain. Two were directed against a sequence common for GABAb 
25 receptor la and b (abl and ab2), one against a GABAb receptor la-specific region (al) and 
one against a GABA B receptor lb-specific sequence (bl). To allow BSA-conjugation, a 
cystein residue was added to the N-terminus in all peptides but al, which contains an 
endogenous cystein. 

30 Peptide al : (SEQ ID NO: 49 amino acids 18-37) 

Gly Gly Ala Gin Thr Pro Asn Ala Thr Ser Glu Gly Cys Gin He He Jiis Pro Pro Trp 
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Peptide abl : (SEQ ID NO: 49 amino acids 197-216 with N-terminally added Cys) 

Cys - Glu Asp Val Asn Ser Arg Arg Asp Be Leu Pro Asp Tyr Glu Leu Lys Leu lie 

His His 

Peptide ab2: (SEQ ID NO: 49 amino acids 271-290 with N-terminally added Cys) 

Cys - Ser Pro Ala Leu Ser Asn Arg Gin Arg Phe Pro Thr Phe Phe Arg Thr His Pro 
Ser Ala 

Peptide bl : (SEQ ID NO: 57 amino acids 30-47 with N-terminally added Cys) 

Cys - Ser His Ser Pro His Leu Pro Arg Pro His Ser Arg Val Pro Pro His Pro Ser 

The antibodies were purified from rabbit serum by affinity chromatography using the 
corresponding immobilized peptide and subsequently used to detect expression of 
recombinant GABA B receptor isoforms on Western blots. 



EXAMPLE 10: Heterologous expression of GAB As receptor isoforms in mammalian cells 

A Hindin / Sail cDNA fragment encoding the human GABAb receptor lb isoform was 
cloned into a BPV (bovine papilloma virus)-based expression vector containing the mMT- 
1 (murine metallothioneine) promoter. Using the calcium phosphate method, murine CI 27 
cells were co-transfected with the GABAb receptor expression construct and an expression 
plasmid containing a G418 resistance marker gene using the calcium phosphate method. 
G418 resistant clones were evaluated by Western blot analysis for expression of the 
approximately 100 kDaGABA B receptor lb isoform(Figure 2). Similarly, but using 
Lipofectamine (Life Technologies Inc, Rockville, MD, USA) for transfection, th^human 
GABAb receptor lb isoform was expressed in human HEK-293 cells using the pCI-neo 
expression vector. The identity of the heterologously expressed receptor was verified in 
HEK-293 cells by Western blot analysis and radiologand binding experiments. 

A cDNA fragment encoding the human GABAb receptor Id isoform was^cloned into a 
BPV-based expression vector containing the mMT-1 promotor. Using the calcium 
phosphate method, murine CI 27 cells were co-transfected with the GABAb receptor 
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expression construct and an expression plasmid containing a G418 resistance marker gene. 
G418 resistant clones, and concentrated medium from the same clones, were evaluated for 
GABAb receptor Id isoform expression by Western blot analysis (Figure 3). The 
experiment revealed the human GABA B receptor Id to be a secreted isoform. 



EXAMPLE 1 1 : Heterologous expression of GAB A B receptor isoforms in E. coli 

A cDNA fragment encoding the human GABAb receptor Id isoform was cloned into a 
10 modified pET (Pharmacia Amersham, Uppsala, Sweden) vector downstream of a STII 

(heat stable enterotoxin II of E. coli) signal peptide and followed by a thrombin clevage site 
and a hexahistidine tag. The expression construct was subsequently used to transform the 
BL21 (DE3) E. coli strain. Western blot analysis of IPTG-induced bacteria revealed 
expression of a human GABA B receptor Id isoform of expected size(Figure 4). 



15 



In addition, the human GABAb receptor Id isoform was successfully expressed without 
fusion to a bacterial signal peptide in E. coli strain AD494 (DE3). 



20 EXAMPLE 12: Method for the screening of substances which are GABAb receptor 
antagonists or agonists 

GAB A B receptor expressing cells, transgenic animals or cells and tissues derived thereof, 
are used to screen substance libraries for antagonist or agonist activities. Screening is 
25 performed as ligand binding assays or functional assays. For screening, cells and tissues are 
prepared in various ways, each uniquely suited to its purpose. Ligand binding assays are 
performed in vivo or in vitro. Functional assays exemplified by, but not limited to, Ga**- 
responses, cAMP-responses and effects on Cl~ and K + channels, are performed in living 
cells, broken cells, isolated cell membranes, as well as in tissues and in living animals. 
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1 . A nucleic acid molecule encoding a human or canine GAB Ag receptor, or a 
functionally equivalent modified form thereof. 

2. A nucleic acid molecule according to claim 1 encoding a human GAB Ag receptor, or 
a functionally equivalent modified form thereof. 

3. A nucleic acid molecule according to claim 2 encoding the human GAB Ag receptor 
la, or a functionally equivalent modified form thereof. 

4. A nucleic acid molecule according to claim 3 selected from: 

(a) a nucleic acid molecule comprising a nucleotide sequence set forth as SEQ ID 
NO: 48; 

(b) a nucleic acid molecule comprising a nucleotide sequence capable of hybridizing, 
under stringent hybridization conditions, to a nucleotide sequence complementary to 
the polypeptide coding region of a DNA molecule as defined in (a); and 

(c) a nucleic acid molecule comprising a nucleotide sequence which is degenerate as 
a result of the genetic code to a nucleotide sequence as defined in (a) or (b). 

5. A nucleic acid molecule according to claim 2 encoding the human GABAg receptor 
lb, or a functionally equivalent modified form thereof. 

6. A nucleic acid molecule according to claim 5 selected from: 

(a) nucleic acid molecules comprising a nucleotide sequence set forth as SEQ ID NO: 
50; 

(b) nucleic acid molecules comprising a nucleotide sequence capable of hybridizing, 
under stringent hybridization conditions, to a nucleotide sequence complementary the 
polypeptide coding region of a DNA molecule as defined in (a); and 

(c) nucleic acid molecules comprising a nucleotide sequence which is degenerate as a 
result of the genetic code to a nucleotide sequence as defined in (a) or (b). 
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7. A nucleic acid molecule according to claim 2 encoding the human GAB receptor 
lc, or a functionally equivalent modified form thereof. 

8. A nucleic acid molecule according to claim 7 selected from: 

(a) nucleic acid molecules comprising a nucleotide sequence set forth as SEQ ID NO: 
54; 

(b) nucleic acid molecules comprising a nucleotide sequence capable of hybridizing, 
under stringent hybridization conditions, to a nucleotide sequence complementary the 
polypeptide coding region of a DNA molecule as defined in (a); and 

(c) nucleic acid molecules comprising a nucleotide sequence which is degenerate as a 
result of the genetic code to a nucleotide sequence as defined in (a) or (b). 

9. A nucleic acid molecule according to claim 2 encoding the human GABAu receptor 
Id, or a functionally equivalent modified form thereof.. 

10. A nucleic acid molecule according to claim 9 selected from: 

(a) nucleic acid molecules comprising a nucleotide sequence set forth as SEQ ID NO: 
56; 

(b) nucleic acid molecules comprising a nucleotide sequence capable of hybridizing, 
under stringent hybridization conditions, to a nucleotide sequence complementary the 
polypeptide coding region of a DNA molecule as defined in (a); and 

(c) nucleic acid molecules comprising a nucleotide sequence which is degenerate as a 
result of the genetic code to a nucleotide sequence as defined in (a) or (b). 

11. A nucleic acid molecule according to claim 1 encoding a canine GAB Ag receptor, or 
a functionally equivalent modified form thereof. 

12. A nucleic acid molecule according to claim 1 1 encoding the canine GABAg receptor 
la, or a functionally equivalent modified form thereof. 

13. A nucleic acid molecule according to claim 12 selected from 
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jo 

(a) nucleic acid molecules comprising a nucleotide sequence set forth as SEQ ID NO: " 

52; 

(b) nucleic acid molecules comprising a nucleotide sequence capable of hybridizing, 
under stringent hybridization conditions, to a nucleotide sequence complementary the 

5 polypeptide coding region of a DNA molecule as defined in (a); and 

(c) nucleic acid molecules comprising a nucleotide sequence which is degenerate as a 
result of the genetic code to a nucleic acid sequence as defined in (a) or (b). 

14 A nucleic acid molecule according to claim 1 1 encoding the canine GABAg receptor 
10 1 c, or a functionally equivalent modified form thereof. 



15. A nucleic acid molecule according to claim 14 selected from 

(a) nucleic acid molecules comprising a nucleotide sequence set forth as SEQ ID NO: 
58; 

15 (b) nucleic acid molecules comprising a nucleotide sequence capable of hybridizing, 

under stringent hybridization conditions, to a nucleotide sequence complementary the 
polypeptide coding region of a DNA molecule as defined in (a); and 
(c) nucleic acid molecules comprising a nucleotide sequence which is degenerate as a 
result of the genetic code to a nucleic acid sequence as defined in (a) or (b). 

20 

16. A recombinant polypeptide encoded by a nucleotide sequence according to any one 
of claims 1 to 15. 



17. The polypeptide according to claim 16 comprising an amino acid sequence set forth 
25 as SEQ ID NO: 49, 5 1 , 53, 55, 57 or 59 in the Sequence Listing. 

18. A polypeptide according to claim 16 which has been posttranslationally modified. 

19. A vector transformed with a nucleic acid molecule according to any one of claims 1 
30 to 15. 

20. A cultured host cell harboring a vector according to claim 19. 
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21 . A process for the production of a GABAb receptor, said process comprising 
culturing a host cell according to claim 20 under conditions whereby the said 
polypeptide is produced, and recovering the said polypeptide. 

22. A method for the screening of compounds which are inhibitors of transient lower 
esophageal sphincter relaxations, said method comprising the use of a nucleic acid 
molecule encoding a GABAb receptor 

23. A method according to claim 22 wherein the said nucleic acid molecule encoding a 
GABAb receptor is the nucleic acid molecule according to any one of claims 1 to 15. 

24. A method for the screening of compounds which are agonists or antagonists to a 
GABAb receptor, said method comprising the use of a nucleic acid molecule 
according to any one of claims 1 to 15. 

25. A method according to any one of claims 22 to 24 comprising the steps (a) 
transforming a cultured cell with a nucleic acid molecule encoding a GABAb 
receptor, so that a GABAb receptor is expressed on the surface of the cell; (b) 
contacting a test compound with the said cell; and (c) determining whether the test 
compound binds to, and/or activate, the GABAb receptor. 

26. A pharmaceutical compositions comprising a soluble GAB A B receptor further 
comprising one or more pharamceutical acceptable carriers and/or diluents. 
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SEQUENCE LISTING 



(1) GENERAL INFORMATION: 

(i) APPLICANT: 

(A) NAME: ASTRA AB 

10 (B) STREET: Vastra Malarehamnen 9 

(C) CITY: Sodertalje 

{ E ) COUNTRY : Sweden 

(F) POSTAL CODE (ZIP): S-151 85 

(G) TELEPHONE: +46-8-553 260 00 
15 (H) TELEFAX: +46-8-553 288 20 

(I) TELEX: 19237 astra s 

(ii) TITLE OF INVENTION: New nucleotide sequences 
20 (iii) NUMBER OF SEQUENCES: 85 

(iv) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

25 (C) OPERATING SYSTEM: PC-DOS/MS-DOS 

CD) SOFTWARE: Patentln Release #1.0, Version #1.30 (EPO) 



30 



40 



(2) INFORMATION FOR SEQ ID NO: 1: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 base pairs 

( B ) TYPE : nucleic acid 

(C) STRANDEDNESS: single 
35 ( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 1: 



GTTTCTTCTC GGATCCAGCT GTGCCTG 27 



H1865-1 WO SEQ 
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( 2 ) INFORMATION FOR SEQ ID NO: 2 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 28 base pairs 

(B) TYPE: nucleic acid 

( C ) STRANDEDNES S : s ingle 

( D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 

( 

CAGGCACAGC TGGATCCGAG AAGAAACT 

(2) INFORMATION FOR SEQ ID NO : 3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 8 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNES S : single 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 3: 
CGGTCGACTC ACTTGTAAAG CAAATGTACT CGACTCCC 



(2) INFORMATION FOR SEQ ID NO : 4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 41 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNES S : single 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 



H1865-1 WO SEQ 2 



SUBSTITUTE SHEET (RULE 26) 



WO 99/21890 PCT/SE98/01947 



ATGCGCGCCG GCAGCCAACA TGCTGCTGCT GCTGCTGGTG C 41 



5 (2) INFORMATION FOR SEQ ID NO : 5: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 46 base pairs 

(B) TYPE: nucleic acid 

10 (C) STRANDEDNESS: single 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: DNA 

15 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 

CGGTCGACTC ACTTGTAAAG CAAATGTACT CGACTCCCAT CACAGC 46 



20 (2) INFORMATION FOR SEQ ID NO : 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 50 base pairs 

(B) TYPE: nucleic acid 

25 (C) STRANDEDNESS: single 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: DNA 

30 (xi) SEQUENCE DESCRIPTION: SEQ ID NO : 6: 

ATGCGCGCCG GCAGCCAACA TGCTGCTGCT GCTGCTGGTG CCTCTCTTCC 50 



35 ( 2 ) INFORMATION FOR SEQ ID NO : 7 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 40 base pairs 

(B) TYPE: nucleic acid 
40 ( C ) STRANDEDNESS : s ingle 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: DNA 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 
CAGGCACAGC TGGATCCGAG AAGAAACTCT GTCGGAAAGT 

(2) INFORMATION FOR SEQ ID NO : 8: 

.(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 8: 
GGTCATCCAG CGTTGAGGTG AAGAC 

(2) INFORMATION FOR SEQ ID NO : 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 5 base pairs 

(B) TYPE : nucleic acid 

(C) STRANDEDNESS: single 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9: 
GAAGGTTGCC AGATTATACA TCCGC 

(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

( D ) TOPOLOGY : 1 inear 
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5 



<ii> MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 
CCACGATGAT TCGAGCATCT TGACG 25 

(2) INFORMATION FOR SEQ ID NO: 11: 

10 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 base pairs 

( B ) TYPE : nucleic acid 

( C ) STRANDEDNESS : s ingle 
15 { D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 



20 



25 



35 



GCCTCTCACT CCCCTCATCT CC 22 



(2) INFORMATION FOR SEQ ID NO: 12: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 
30 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 



GAGTGAAGGA GGCTGGAATT G 21 



(2) INFORMATION FOR SEQ ID NO: 13: 

40 

( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 
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(C) STRANDEDNESS: single 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 
GACGCTTATC GAGCAGCTTC 

(2) INFORMATION FOR SEQ ID NO: 14: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

( C ) STRANDEDNESS : s ingle 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14: 
AGCCCAGAAC TCACAGGGGG ACAT 



(2) INFORMATION FOR SEQ ID NO : 15: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 
GCTTCAAGCC AGGTACGAAC TAA 

(2) INFORMATION FOR SEQ ID NO: 16: 
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(i) SEQUENCE CHARACTERISTICS: — 

(A) LENGTH: 27 base pairs 

(B) TYPE: nucleic acid 

{ C ) STRANDEDNES S : s ingl e 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16: 
TGGCCCTCCA CCGCCTCAGT CATCTCA 27 

(2) INFORMATION FOR SEQ ID NO: 17: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNES S : single 
20 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17: 

25 

CCATCCTAAT ACGACTCACT ATAGGGC 27 



(2) INFORMATION FOR SEQ ID NO : 18: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNES S : single 
35 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 18: 

40 

CTCAATCTCA TAGTCCACTG G 21 
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(2) INFORMATION FOR SEQ ID NO: 19: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : single 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19: 
CCTTGAGGCC CGGGGAGAG 



PCT/SE98/01947 



(2) INFORMATION FOR SEQ ID NO: 20: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 8 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 20: 
CAGGCACAGC TGGATCCGAG AAGAAACT 

(2) INFORMATION FOR SEQ ID NO: 21: 

{ i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 46 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 21: 
CGGTCGACTC ACTTGTAAAG CAAATGTACT CGACTCCCAT CACAGC 
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(2) INFORMATION FOR SEQ ID NO: 22: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 50 base pairs 

(B) TYPE: nucleic acid 
<C) STRANDEDNESS : single 
(D) TOPOLOGY : linear 



(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 22: 
15 ATGCGCGCCG GCAGCCAACA TGCTGCTGCT GCTGCTGGTG CCTCTCTTCC 50 

(2) INFORMATION FOR SEQ ID NO : 23: 

20 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 base pairs 

(B) TYPE: nucleic acid 

( C ) STRANDEDNESS : s ingle 

( D ) TOPOLOGY : 1 inear 

25 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 23: 
30 CGTCAAGATG CTCGAATCAT CG 22 

(2) INFORMATION FOR SEQ ID NO: 24: 

35 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

( C ) STRANDEDNESS : s ingle 
{ D ) TOPOLOGY : 1 inear 



(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 24: 
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CAGGGGGCTC AGAGGGTCCC 



20 



5 



(2) INFORMATION FOR SEQ ID NO : 25: 



(i) 



SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 40 base pairs 

(B) TYPE: nucleic acid 



10 



(C) STRAND EDNESS : single 



(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: DNA 
15 ( X i) SEQUENCE DESCRIPTION: SEQ ID NO : 25: 

C AGGC AC AG C TGGATCCGAG AAGAAACTCT GTCGGAAAGT 

20 (2) INFORMATION FOR SEQ ID NO: 26: 



(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: DNA 
30 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 26: 

CGGTCGACTC ACTTGTAAAG CAAATGTACT CGACTCCCAT CACAGCTAAG 

35 (2) INFORMATION FOR SEQ ID NO: 27: 



<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 40 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: DNA 



(i) 



SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 50 base pairs 

(B) TYPE: nucleic acid 



25 



(C) STRANDEDNESS: single 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 27: 
ACTTTCCGAC AGAGTTTCTT CTCGGATCCA GCTGTGCCTG 40 

(2) INFORMATION FOR SEQ ID NO: 28: 

(i) SEQUENCE CHARACTERISTICS: 
10 (A) LENGTH: 25 base pairs 

(B) TYPE: nucleic acid 

( C ) STRANDEDNESS : s ingle 

(D) TOPOLOGY: linear 

15 (ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 28: 
CCACGATGAT TCGAGCATCT TGACG . 25 

20 

(2) INFORMATION FOR SEQ ID NO : 29: 

(i) SEQUENCE CHARACTERISTICS: 
25 (A) LENGTH: 24 base pairs 

( B) TYPE : nucleic acid 

(C) STRANDEDNESS : single 

( D ) TOPOLOGY : 1 inear 

30 (ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 29: 
CTACCGCGCA ATGAACTCCT CGTC 24 

35 

(2) INFORMATION FOR SEQ ID NO: 30: 

(i) SEQUENCE CHARACTERISTICS: 
40 (A) LENGTH: 24 base pairs u 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

( D ) TOPOLOGY : 1 inear 
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<ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 30: 
CGAGGTGGCG TTGGGGGTCT GTGC 

(2) INFORMATION FOR SEQ ID NO : 31: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 31: 
CCATCCTAAT ACGACTCACT ATAGGGC 

(2) INFORMATION FOR SEQ ID NO : 32: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 32: 
ACTCACTATA GGGCTCGAGC GGC 

(2) INFORMATION FOR SEQ ID NO : 33: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 0 base pairs 

(B) TYPE: nucleic acid 
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(C) STRANDEDNESS : single — 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 33: 
GACGCTTATC GAGCAGCTTC 20 

(2) INFORMATION FOR SEQ ID NO : 34: 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 24 base pairs 
15 (B) TYPE: nucleic acid 

<C) STRANDEDNESS: single 
( D) TOPOLOGY : linear 



(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 34: 
AGCCCAGAAC TCACAGGGGG ACAT 24 

(2) INFORMATION FOR SEQ ID NO : 35: 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 23 base pairs 
30 (B) TYPE: nucleic acid 

( C ) STRANDEDNESS : s ingle 

( D ) TOPOLOGY : 1 inear 



(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 35: 
"GCTTCAAGCC AGGTACGAAC TAA 23 

(2) INFORMATION FOR SEQ ID NO: 36: 
( i ) SEQUENCE CHARACTERISTICS : 
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(A) LENGTH: 22 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 36: 
GGAGCACCCC CAAGCCCCAC TG 

(2) INFORMATION FOR SEQ ID NO : 37: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 37: 
CTGGTTCCTC CCAATGTG 

(2) INFORMATION FOR SEQ ID NO : 38: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

( C ) STRANDEDNESS : s ingle 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 38: 
CCTCTCACTC CCCTCATCTC 

( 2 ) INFORMATION FOR SEQ ID NO : 3 9: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 base pairs 

(B) TYPE: nucleic acid 

5 (C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

10 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 39: 

AAGCCAACCT TCCCTGCTTC TC 22 

15 (2) INFORMATION FOR SEQ ID NO: 40: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 base pairs 

(B) TYPE:, nucleic acid 

20 (C) STRANDEDNESS: single 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: DNA 

25 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 40: 

CTGGTTCCTC CCAATGTG 18 

30 (2) INFORMATION FOR SEQ ID NO: 41: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 0 base pairs 

(B) TYPE: nucleic acid 

35 (C) STRANDEDNESS: single 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: DNA 

40 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 41: 

GACGCTTATC GAGCAGCTTC * 20 
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(2) INFORMATION FOR SEQ ID NO: 42: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 4 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 42: 
CTACCGCGCA ATGAACTCCT CGTC 

(2) INFORMATION FOR SEQ ID NO: 43: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 43: 
CCTTCTTCTC CTCCTTCTTA GTGA 

(2) INFORMATION FOR SEQ ID NO: 44: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 883 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
{ D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: cDNA to mRNA 

(iii) HYPOTHETICAL: NO 
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(iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Rattus norwegicus 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION : 1 . .2883 



10 (x) PUBLICATION INFORMATION: 

( A ) AUTHORS : Kaupmann , K 

Huggel, K 
Heid, J 
Flor, P 

15 Bischof f , S 

Mickel, S 

McMaster, G 

Angst, C 

Bittiger, H 
20 Froestl, W 

(B) TITLE: Expression cloning of GABA-B receptors 

uncovers similarity to metabotropic glutamate 
receptors 

(C) JOURNAL: Nature 
25 (D) VOLUME: 386 

(F) PAGES: 239-246 

(G) DATE: 20 march- 19 97 



30 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 44: 

ATG CTG CTG CTG CTG CTG GTG CCT CTC TTC CTC CGC CCC CTG GGC GCT 48 

Met Leu Leu Leu Leu Leu Val Pro Leu Phe Leu Arg Pro Leu Gly Ala 
15 10 15 



35 GGC GGG GCG CAG ACC CCC AAC GCC ACC TCG GAA GGT TGC CAG ATT ATA^ 96 
Gly Gly Ala Gin Thr Pro Asn Ala Thr Ser Glu Gly Cys Gin lie lie 
20 25 30 

CAT CCG CCC TGG GAA GGT GGC ATC AGG TAC CGT GGC TTG ACT CGC GAC 144 
40 His Pro Pro Trp Glu Gly Gly lie Arg Tyr Arg Gly Leu Thr Arg Asp 
35 40 45 

CAG GTG AAG GCC ATC AAC TTC CTG CCT GTG GAC TAT GAG ATC GAA TAT 192 
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Gin Val Lys Ala lie Asn Phe Leu Pro Val Asp Tyr Glu lie Glu Tyr 
50 55 60 

GTG TGC CGA GGG GAG CGC GAG GTG GTG GGG CCC AAG GTG CGC AAA TGC 240 
Val Cys Arg Gly Glu Arg Glu Val Val Gly Pro Lys Val Arg Lys Cys 
65 70 75 80 

CTG GCC AAC GGC TCC TGG ACG GAT ATG GAC ACA CCC AGC CGC TGT GTC 2 88 

Leu Ala Asn Gly Ser Trp Thr Asp Met Asp Thr Pro Ser Arg Cys Val 
85 90 95 

CGA ATC TGC TCC AAG TCT TAT TTG ACC CTG GAA AAT GGG AAG GTT TTC 33 6 

Arg He Cys Ser Lys Ser Tyr Leu Thr Leu Glu Asn Gly Lys Val Phe 
100 105 HO 

CTG ACG GGT GGG GAC CTC CCA GCT CTG GAT GGA GCC CGG GTG GAG TTC 384 
Leu Thr Gly Gly Asp Leu Pro Ala Leu Asp Gly Ala Arg Val Glu Phe 
115 120 125 

CGA TGT GAC CCC GAC TTC CAT CTG GTG GGC AGC TCC CGG AGC GTC TGT 432 
Arg Cys Asp Pro Asp Phe His Leu Val Gly Ser Ser Arg Ser Val Cys 
130 135 140 

AGT CAG GGC CAG TGG AGC ACC CCC AAG CCC CAC TGC CAG GTG AAT CGA 480 
Ser Gin Gly Gin Trp Ser Thr Pro Lys Pro His Cys Gin Val Asn Arg 
145 150 155 160 

ACG CCA CAC TCA GAA CGG CGT GCA GTA TAC ATC GGG GCG CTG TTT CCC 528 
Thr Pro His Ser Glu Arg Arg Ala Val Tyr He Gly Ala Leu Phe Pro 
165 170 175 

ATG AGC GGG GGC TGG CCG GGG GGC CAG GCC TGC CAG CCC GCG GTG GAG 576 
Met Ser Gly Gly Trp Pro Gly Gly Gin Ala Cys Gin Pro Ala Val Glu 
180 185 190 

ATG GCG CTG GAG GAC GTT AAC AGC CGC AGA GAC ATC CTG CCG GAC TAC 624 
Met Ala Leu Glu Asp Val Asn Ser Arg Arg Asp He Leu Pro Asp Tyr 
195 200 205 

GAG CTC AAG CTT ATC CAC CAC GAC AGC AAG TGT GAC CCA GGG CAA GCC 67 2 L 

Glu Leu Lys Leu He His His Asp Ser Lys Cys Asp Pro Gly Gin Ala 
210 215 220 
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ACC AAG TAC TTG TAC GAA CTA CTC TAC AAT GAC CCC ATC AAG ATC ATT 
Thr Lys Tyr Leu Tyr Glu Leu Leu Tyr Asn Asp Pro lie Lys lie lie 
225 230 235 240 



7:2-0 



5 CTC ATG CCT GGC TGT AGT TCT GTC TCC ACA CTT GTA GCT GAG GCT GCC 
Leu Met Pro Gly Cys Ser Ser Val Ser Thr Leu Val Ala Glu Ala Ala 
245 250 255 



768 



CGG ATG TGG AAC CTT ATT GTG CTC TCA TAT GGC TCC AGT TCA CCA GCC 
10 Arg Met Trp Asn Leu lie Val Leu Ser Tyr Gly Ser Ser Ser Pro Ala 
260 265 270 



816 



TTG TCA AAC CGA CAG CGG TTT CCC ACG TTC TTC CGG ACG CAT CCA TCC 
Leu Ser Asn Arg Gin Arg Phe Pro Thr Phe Phe Arg Thr His Pro Ser 
15 275 280 285 



864 
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GCC ACA CTC CAC AAT CCC ACC CGG GTG AAA CTC TTC GAA AAG TGG GGC 912 

Ala Thr Leu His Asn Pro Thr Arg Val Lys Leu Phe Glu Lys Trp Gly 
290 295 300 

TGG AAG AAG ATC GCT ACC ATC CAA CAG ACC ACC GAG GTC TTC ACC TCA 960 

Trp Lys Lys lie Ala Thr lie Gin Gin Thr Thr Glu Val Phe Thr Ser 
305 310 315 320 



25 ACG CTG GAT GAC CTG GAG GAG CGA GTG AAA GAG GCT GGG ATC GAG ATC 
Thr Leu Asp Asp Leu Glu Glu Arg Val Lys Glu Ala Gly lie Glu lie 
325 330 335 



1008 



ACT TTC CGA CAG AGT TTC TTC TCG GAT CCA GCT GTG CCT GTT AAA AAC 
30 Thr Phe Arg Gin Ser Phe Phe Ser Asp Pro Ala Val Pro Val Lys Asn 
340 345 350 



1056 



CTG AAG CGT CAA GAT GCT CGA ATC ATC GTG GGA CTT TTC TAT GAG ACG 
Leu Lys Arg Gin Asp Ala Arg lie lie Val Gly Leu Phe Tyr Glu Thr 
35 3 5 5 3 6 0 3 6 5 



1104 



40 



GAA GCC CGG AAA GTT TTT TGT GAG GTC TAT AAG GAA AGG CTC TTT GGG 1152 
Glu Ala Arg Lys Val Phe Cys Glu Val Tyr Lys Glu Arg Leu Phe Gly 
370 375 380 

AAG AAG TAC GTC TGG TTC CTC ATC GGG TGG TAT GCT GAC AAC TGG TTC 1200 
Lys Lys Tyr Val Trp Phe Leu lie Gly Trp Tyr Ala Asp Asn Trp Phe 
385 390 395 400 
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AAG ACC TAT GAC CCG TCA ATC AAT TGT ACA GTG GAA GAA ATG ACC GAG 12^8 
Lys Thr Tyr Asp Pro Ser lie Asn Cys Thr Val Glu Glu Met Thr Glu 
405 410 415 

5 

GCG GTG GAG GGC CAC ATC ACC ACG GAG ATT GTC ATG CTG AAC CCT GCC 1296 
Ala Val Glu Gly His lie Thr Thr Glu He Val Met Leu Asn Pro Ala 
420 425 430 

10 AAC ACC CGA AGC ATT TCC AAC ATG ACG TCA CAG GAA TTT GTG GAG AAA 1344 
Asn Thr Arg Ser He Ser Asn Met Thr Ser Gin Glu Phe Val Glu Lys 
435 440 445 

CTA ACC AAG CGG CTG AAA AGA CAC CCC GAG GAG ACT GGA GGC TTC CAG 1392 
15 Leu Thr Lys Arg Leu Lys Arg His Pro Glu Glu Thr Gly Gly Phe Gin 
450 455 460 

GAG GCA CCA CTG GCC TAT GAT GCT ATC TGG GCC TTG GCT TTG GCC TTG 1440 
Glu Ala Pro Leu Ala Tyr Asp Ala He Trp Ala Leu Ala Leu Ala Leu 
20 465 470 475 • 480 

AAC AAG ACG TCT GGA GGA GGT GGT CGT TCC GGC GTG CGC CTG GAG GAC 1488 

Asn Lys Thr Ser Gly Gly Gly Gly Arg Ser Gly Val Arg Leu Glu Asp 
485 490 495 

25 

TTT AAC TAC AAC AAC CAG ACC ATT ACA GAC CAG ATC TAC CGG GCC ATG 153 6 

Phe Asn Tyr Asn Asn Gin Thr lie Thr Asp Gin He Tyr Arg Ala Met 
500 505 510 

30 AAC TCC TCC TCC TTT GAG GGC GTT TCT GGC CAT GTG GTC TTT GAT GCC 15 84 

Asn Ser Ser Ser Phe Glu Gly Val Ser Gly His Val Val Phe Asp Ala 
515 520 525 

AGC GGC TCC CGG ATG GCA TGG ACA CTT ATC GAG CAG CTA CAG GGC GGC 1632 
35 Ser Gly Ser Arg Met Ala Trp Thr Leu He Glu Gin Leu Gin Gly Gly 
530 535 540 

AGC TAC AAG AAG ATC GGC TAC TAC GAC AGC ACC AAG GAT GAT CTT TCC 1680 
Ser Tyr Lys Lys He Gly Tyr Tyr Asp Ser Thr Lys Asp Asp Leu Ser 
40 545 550 555 560 

TGG TCC AAA ACG GAC AAG TGG ATT GGA GGG TCT CCC CCA GCT ^ GAC CAG 1728 
Trp Ser Lys Thr Asp Lys Trp He Gly Gly Ser Pro Pro Ala Asp Gin 
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ACC TTG GTC ATC AAG ACA TTC CGT 
Thr Leu Val lie Lys Thr Phe Arg 
5 580 



PCT/SE98/01947 

570 575 — 

TTC CTG TCT CAG AAA CTC TTT ATC 1776 
Phe Leu Ser Gin Lys Leu Phe lie 
585 590 



TCC GTC TCA GTT CTC TCC AGC CTG GGC ATT GTT CTT GCT GTT GTC TGT 1824 

Ser Val Ser Val Leu Ser Ser Leu Gly lie Val Leu Ala Val Val Cys 
595 600 605 

10 

CTG TCC TTT AAC ATC TAC AAC TCC CAC GTT CGT TAT ATC CAG AAC TCC 1872 

Leu Ser Phe Asn lie Tyr Asn Ser His Val Arg Tyr lie Gin Asn Ser 
610 615 620 



15 CAG CCC AAC CTG AAC AAT CTG ACT GCT GTG GGC TGC TCA CTG GCA CTG 1920 
Gin Pro Asn Leu Asn Asn Leu Thr Ala Val Gly Cys Ser Leu Ala Leu 
625 630 635 640 



GCT GCT GTC TTC CCT CTC GGG CTG GAT GGT TAC CAC ATA GGG AGA AGC 1968 
20 Ala Ala Val Phe Pro Leu Gly Leu Asp Gly Tyr His lie Gly Arg Ser 

645 650 655 



CAG TTC CCG TTT GTC TGC CAG GCC CGC CTT TGG CTC TTG GGC TTG GGC 2016 
Gin Phe Pro Phe Val Cys Gin Ala Arg Leu Trp Leu Leu Gly Leu Gly 
25 660 665 670 



TTT AGT CTG GGC TAT GGC 
Phe Ser Leu Gly Tyr Gly 
675 

30 

ACA GTC TTC ACG AAG AAG 
Thr Val Phe Thr Lys Lys 
690 



TCT ATG TTC ACC AAG ATC 
Ser Met Phe Thr Lys lie 
680 

GAG GAG AAG AAG GAG TGG 
Glu Glu Lys Lys Glu Trp 
695 700 



TGG TGG GTC CAC 2064 

Trp Trp Val His 

685 

AGG AAG ACC CTA 2112 
Arg Lys Thr Leu 



35 GAG CCC TGG AAA CTC TAT GCC ACT GTG GGC CTG CTG GTG GGC ATG GAT- 2160 
Glu Pro Trp Lys Leu Tyr Ala Thr Val Gly Leu Leu Val Gly Met Asp 
705 710 715 720 

GTC CTG ACT CTT GCC ATC TGG CAG ATT GTG GAC CCC TTG CAC CGA ACC 2208 
40 Val Leu Thr Leu Ala lie Trp Gin lie Val Asp Pro Leu His Arg Thr 

725 730 735 

ATT GAG ACT TTT GCC AAG GAG GAA CCA AAG GAA GAC ATC GAT GTC TCC 2256 
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He Glu Thr Phe Ala Lys Glu Glu Pro Lys Glu Asp He Asp Val Ser — 
740 745 750 

ATT CTG CCC CAG TTG GAG CAC TGC AGC TCC AAG AAG ATG AAT ACG TGG 23 04 

He Leu Pro Gin Leu Glu His Cys Ser Ser Lys Lys Met Asn Thr Trp 
755 760 765 

CTT GGC ATT TTC TAT GGT TAC AAG GGG CTG CTG CTG CTG CTG GGA ATC 2352 
Leu Gly He Phe Tyr Gly Tyr Lys Gly Leu Leu Leu Leu Leu Gly He 
770 775 780 

TTT CTT GCT TAC GAA ACC AAG AGC GTG TCC ACT GAA AAG ATC AAT GAC 2400 
Phe Leu Ala Tyr Glu Thr Lys Ser Val Ser Thr Glu Lys He Asn Asp 
785 790 795 800 

CAC AGG GCC GTG GGC ATG GCT ATC TAC AAT GTC GCG GTC CTG TGT CTC 2448 
His Arg *Ala Val Gly Met Ala He Tyr Asn Val Ala Val Leu Cys Leu 
805 810 815 

ATC ACT GCT CCT GTG ACC ATG ATC CTT TCC AGT CAG CAG GAC GCA GCC 2496 
He Thr Ala Pro Val Thr Met He Leu Ser Ser Gin Gin Asp Ala Ala 
820 825 830 

TTT GCC TTT GCC TCT CTG GCC ATC GTG TTC TCT TCC TAC ATC ACT CTG 2544 
Phe Ala Phe Ala Ser Leu Ala He Val Phe Ser Ser Tyr He Thr Leu 
835 840 845 

GTT GTG CTC TTT GTG CCC AAG ATG CGC AGG CTG ATC ACC CGA GGG GAA 2592 
Val Val Leu Phe Val Pro Lys Met Arg Arg Leu He Thr Arg Gly Glu 
850 855 860 

TGG CAG TCT GAA ACG CAG GAC ACC ATG AAA AC A GGA TCA TCC ACC AAC 2640 
Trp Gin Ser Glu Thr Gin Asp Thr Met Lys Thr Gly Ser Ser Thr Asn 
865 870 875 880 

AAC AAC GAG GAA GAG AAG TCC CGA CTG TTG GAG AAG GAA AAC CGA GAA 2688 
Asn Asn Glu Glu Glu Lys Ser Arg Leu Leu Glu Lys Glu Asn Arg Glu 
885 890 895 

CTG GAA AAG ATC ATC GCT GAG AAA GAG GAG CGC GTC TCT GAA CTG CGC 273 6 

Leu Glu Lys He He Ala Glu Lys Glu Glu Arg Val Ser Glu Leu Arg 
900 905 910 * 
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CAT CAG CTC CAG TCT CGG CAG CAA CTC CGC TCA CGG CGC CAC CCC CCA 278"4 
His Gin Leu Gin Ser Arg Gin Gin Leu Arg Ser Arg Arg His Pro Pro 
915 920 925 

5 ACA CCC CCA GAT CCC TCT GGG GGC CTT CCC AGG GGA CCC TCT GAG CCC 2832 
Thr Pro Pro Asp Pro Ser Gly Gly Leu Pro Arg Gly Pro Ser Glu Pro 
930 935 940 

CCT GAC CGG CTT AGC TGT GAT GGG AGT CGA GTA CAT TTG CTT TAC AAG 2880 
10 Pro Asp Arg Leu Ser Cys Asp Gly Ser Arg Val His Leu Leu Tyr Lys 
945 950 955 960 

TGA 2883 
* 

15 

(2) INFORMATION FOR SEQ ID NO : 45: 

(i) SEQUENCE CHARACTERISTICS: 
20 (A) LENGTH: 960 amino acids 

(B) TYPE: amino acid 
( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 45: 

Met Leu Leu Leu Leu Leu Val Pro Leu Phe Leu Arg Pro Leu Gly Ala 
15 10 15 

Gly Gly Ala Gin Thr Pro Asn Ala Thr Ser Glu Gly Cys Gin lie lie 
20 25 30 

His Pro Pro Trp Glu Gly Gly lie Arg Tyr Arg Gly Leu Thr Arg Asp 
35 3 5 4 0 4 5 

Gin Val Lys Ala lie Asn Phe Leu Pro Val Asp Tyr Glu lie Glu Tyr 
50 55 60 

40 Val Cys Arg Gly Glu Arg Glu Val Val Gly Pro Lys Val Arg Lys Cys :j 
65 70 75 80 

Leu Ala Asn Gly Ser Trp Thr Asp Met Asp Thr Pro Ser Arg Cys Val 
HI 865-1 WO SEQ 23 
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85 90 ' 95 — 

Arg lie Cys Ser Lys Ser Tyr Leu Thr Leu Glu Asn Gly Lys Val Phe 
100 105 HO 

5 

Leu Thr Gly Gly Asp Leu Pro Ala Leu Asp Gly Ala Arg Val Glu Phe 
115 120 125 

Arg Cys Asp Pro Asp Phe His Leu Val Gly Ser Ser Arg Ser Val Cys 
10 13 0 135 140 

Ser Gin Gly Gin Trp Ser Thr Pro Lys Pro His Cys Gin Val Asn Arg 
145 150 155 160 

15 Thr Pro His Ser Glu Arg Arg Ala Val Tyr He Gly Ala Leu Phe Pro 

165 170 175 

Met Ser Gly Gly Trp Pro Gly Gly Gin Ala Cys Gin Pro Ala Val Glu 
180 185 190 

20 

Met Ala Leu Glu Asp Val Asn Ser Arg Arg Asp He Leu Pro Asp Tyr 
195 200 205 

Glu Leu Lys Leu He His His Asp Ser Lys Cys Asp Pro Gly Gin Ala 
25 210 215 220 

Thr Lys Tyr Leu Tyr Glu Leu Leu Tyr Asn Asp Pro He Lys He He 
225 230 235 240 

30 Leu Met Pro Gly Cys Ser Ser Val Ser Thr Leu Val Ala Glu Ala Ala 

245 250 255 

Arg Met Trp Asn Leu He Val Leu Ser Tyr Gly Ser Ser Ser Pro Ala 
260 265 270 

35 

Leu Ser Asn Arg Gin Arg Phe Pro Thr Phe Phe Arg Thr His Pro Ser 
275 280 285 

Ala Thr Leu His Asn Pro Thr Arg Val Lys Leu Phe Glu Lys Trp Gly 
40 290 295 300 

Trp Lys Lys He Ala Thr He Gin Gin Thr Thr Glu Val Phe Thr Ser 
305 310 315 320 



H1865-1 WO SEQ 24 

SUBSTITUTE SHEET (RULE 26) 



WO 99/21890 PCT/SE98/01947 



Thr Leu Asp Asp Leu Glu Glu Arg Val Lys Glu Ala Gly lie Glu lie 
325 330 335 

5 Thr Phe Arg Gin Ser Phe Phe Ser Asp Pro Ala Val Pro Val Lys Asn 
340 345 350 

Leu Lys Arg Gin Asp Ala Arg lie lie Val Gly Leu Phe Tyr Glu Thr 
355 360 365 

10 

Glu Ala Arg Lys Val Phe Cys Glu Val Tyr Lys Glu Arg Leu Phe Gly 
370 375 380 

Lys Lys Tyr Val Trp Phe Leu lie Gly Trp Tyr Ala Asp Asn Trp Phe 
15 385 390 395 400 

Lys Thr Tyr Asp Pro Ser lie Asn Cys Thr Val Glu Glu Met Thr Glu 
405 410 415 

20 Ala Val Glu Gly His lie Thr Thr Glu lie Val Met Leu Asn Pro Ala 
420 425 430 

Asn Thr Arg Ser lie Ser Asn Met Thr Ser Gin Glu Phe Val Glu Lys 
435 440 445 

25 

Leu Thr Lys Arg Leu Lys Arg His Pro Glu Glu Thr Gly Gly Phe Gin 
450 455 460 

Glu Ala Pro Leu Ala Tyr Asp Ala lie Trp Ala Leu Ala Leu Ala Leu 
30 465 470 475 480 

Asn Lys Thr Ser Gly Gly Gly Gly Arg Ser Gly Val Arg Leu Glu Asp 
485 490 495 

35 Phe Asn Tyr Asn Asn Gin Thr lie Thr Asp Gin lie Tyr Arg Ala Met- 
500 505 510 

Asn Ser Ser Ser Phe Glu Gly Val Ser Gly His Val Val Phe Asp Ala 
515 520 525 

40 

Ser Gly Ser Arg Met Ala Trp Thr Leu lie Glu Gin Leu Gin Gly Gly 
530 535 540 
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Ser Tyr Lys Lys lie Gly Tyr Tyr Asp Ser Thr Lys Asp Asp Leu Ser ~ 
545 550 555 560 

Trp Ser Lys Thr Asp Lys Trp lie Gly Gly Ser Pro Pro Ala Asp Gin 
565 570 575 

Thr Leu Val lie Lys Thr Phe Arg Phe Leu Ser Gin Lys Leu Phe lie 
580 585 590 

Ser Val Ser Val Leu Ser Ser Leu Gly lie Val Leu Ala Val Val Cys 
595 600 605 

Leu Ser Phe Asn lie Tyr Asn Ser His Val Arg Tyr lie Gin Asn Ser 
610 615 620 

Gin Pro Asn Leu Asn Asn Leu Thr Ala Val Gly Cys Ser Leu Ala Leu 
625 630 635 640 

Ala Ala Val Phe Pro Leu Gly Leu Asp Gly Tyr His lie Gly Arg Ser 
645 650 655 

Gin Phe Pro Phe Val Cys Gin Ala Arg Leu Trp Leu Leu Gly Leu Gly 
660 665 670 

Phe Ser Leu Gly Tyr Gly Ser Met Phe Thr Lys lie Trp Trp Val His 
675 680 685 

Thr Val Phe Thr Lys Lys Glu Glu Lys Lys Glu Trp Arg Lys Thr Leu 
690 695 700 

Glu Pro Trp Lys Leu Tyr Ala Thr Val Gly Leu Leu Val Gly Met Asp 
705 710 715 720 

Val Leu Thr Leu Ala lie Trp Gin lie Val Asp Pro Leu His Arg Thr 
725 730 735 - 

lie Glu Thr Phe Ala Lys Glu Glu Pro Lys Glu Asp lie Asp Val Ser 
740 745 750 

He Leu Pro Gin Leu Glu His Cys Ser Ser Lys Lys Met Asn Thr Trp 
755 760 765 

Leu Gly He Phe Tyr Gly Tyr Lys Gly Leu Leu Leu Leu Leu Gly He 
HI 865-1 WO SEQ 26 



SUBSTITUTE SHEET (RULE 26) 



WO 99/21890 



PCT/SE98/01947 



770 775 780 

Phe Leu Ala Tyr Glu Thr Lys Ser Val Ser Thr Glu Lys lie Asn Asp 
785 790 795 800 

5 

His Arg Ala Val Gly Met Ala lie Tyr Asn Val Ala Val Leu Cys Leu 
805 810 815 

He Thr Ala Pro Val Thr Met He Leu Ser Ser Gin Gin Asp Ala Ala 
10 820 825 830 

Phe Ala Phe Ala Ser Leu Ala lie Val Phe Ser Ser Tyr He Thr Leu 
835 840 845 

15 Val Val Leu Phe Val Pro Lys Met Arg Arg Leu He Thr Arg Gly Glu 
850 855 860 

Trp Gin Ser Glu Thr Gin Asp Thr Met Lys Thr Gly Ser Ser Thr Asn 
865 . 870 875 880 

20 

Asn Asn Glu Glu Glu Lys Ser Arg Leu Leu Glu Lys Glu Asn Arg Glu 
885 890 895 

Leu Glu Lys He He Ala Glu Lys Glu Glu Arg Val Ser Glu Leu Arg 
25 900 905 910 

His Gin Leu Gin Ser Arg Gin Gin Leu Arg Ser Arg Arg His Pro Pro 
915 920 925 

30 Thr Pro Pro Asp Pro Ser Gly Gly Leu Pro Arg Gly Pro Ser Glu Pro 
930 935 940 

Pro Asp Arg Leu Ser Cys Asp Gly Ser Arg Val His Leu Leu Tyr Lys 
945 950 955 960 

35 

(2) INFORMATION FOR SEQ ID NO : 46: 

(i) SEQUENCE CHARACTERISTICS: 
40 (A) LENGTH: 253 8 base pairs 

(B) TYPE: nucleic acid 

( C ) STRANDEDNESS : s ingle 

( D ) TOPOLOGY : 1 inear 
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(ii) MOLECULE TYPE: cDNA to mRNA 
(iii) HYPOTHETICAL: NO 
(iv) ANTI- SENSE : NO 
(vi) ORIGINAL SOURCE: 

^ (A) ORGANISM: Rattus norwegicus 

(ix) FEATURE: 

<A) NAME / KEY : CDS 
(B) LOCATION: 1 . .2532 

(X) PUBLICATION INFORMATION: 

(A) AUTHORS: Kaupmann, K 

Huggel , K 
Heid, J 
Flor, P 
Bischoff, M 
Mickel , S 
McMaster, G 
Angst, C 
Bittiger, H 
Froestl, W 

(B) TITLE: Expression cloning of GABA-B receptors 

uncovers similarity to metabotropic glutamate 
receptors 

(C) JOURNAL: Nature 

(D) VOLUME: 3 86 

(F) PAGES: 239-246 

(G) DATE: 20 March-1997 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 46: 

ATG GGC CCG GGG GGA CCC TGT ACC CCA GTG GGG TGG CCG CTG CCT CTT 48 
Met Gly Pro Gly Gly Pro Cys Thr Pro Val Gly Trp Pro Leu Pro Leu 
1 5 10 15 

CTG CTG GTG ATG GCG GCT GGG GTG GCT CCG GTG TGG GCC TCT CAC TCC 96^ 
Leu Leu Val Met Ala Ala Gly Val Ala Pro Val Trp Ala Ser His Ser 
20 25 3(f 
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CCT CAT CTC CCG CGG CCT CAC CCG AGG GTC CCC CCG CAC CCC TCC TCA H4 
Pro His Leu Pro Arg Pro His Pro Arg Val Pro Pro His Pro Ser Ser 
35 40 45 

5 GAA CGG CGT GCA GTA TAC ATC GGG GCG CTG TTT CCC ATG AGC GGG GGC 192 
Glu Arg Arg Ala Val Tyr lie Gly Ala Leu Phe Pro Met Ser Gly Gly 
50 55 60 

TGG CCG GGG GGC CAG GCC TGC CAG CCC GCG GTG GAG ATG GCG CTG GAG 240 
10 Trp Pro Gly Gly Gin Ala Cys Gin Pro Ala Val Glu Met Ala Leu Glu 
65 70 75 80 

GAC GTT AAC AGC CGC AGA GAC ATC CTG CCG GAC TAC GAG CTC AAG CTT 288 
Asp Val Asn Ser Arg Arg Asp lie Leu Pro Asp Tyr Glu Leu Lys Leu 
15 85 90 95 

ATC CAC CAC GAC AGC AAG TGT GAC CCA GGG CAA GCC ACC AAG TAC TTG 336 
lie His His Asp Ser Lys Cys Asp Pro Gly Gin Ala Thr Lys Tyr Leu 
100 105 110 

20 

TAC GAA CTA CTC TAC AAT GAC CCC ATC AAG ATC ATT CTC ATG CCT GGC 384 
Tyr Glu Leu Leu Tyr Asn Asp Pro lie Lys lie lie Leu Met Pro Gly 
115 120 125 

25 TGT AGT TCT GTC TCC ACA CTT GTA GCT GAG GCT GCC CGG ATG TGG AAC 432 
Cys Ser Ser Val Ser Thr Leu Val Ala Glu Ala Ala Arg Met Trp Asn 
130 135 140 

CTT ATT GTG CTC TCA TAT GGC TCC AGT TCA CCA GCC TTG TCA AAC CGA 480 
30 Leu lie Val Leu Ser Tyr Gly Ser Ser Ser Pro Ala Leu Ser Asn Arg 
145 150 155 160 

CAG CGG TTT CCC ACG TTC TTC CGG ACG CAT CCA TCC GCC ACA CTC CAC 52 8 

Gin Arg Phe Pro Thr Phe Phe Arg Thr His Pro Ser Ala Thr Leu His 
35 165 170 175 

AAT CCC ACC CGG GTG AAA CTC TTC GAA AAG TGG GGC TGG AAG AAG ATC 576 
Asn Pro Thr Arg Val Lys Leu Phe Glu Lys Trp Gly Trp Lys Lys lie 
180 185 190 



40 



GCT ACC ATC CAA CAG ACC ACC GAG GTC TTC ACC TCA ACG CTG GAT GAC 624 
Ala Thr lie Gin Gin Thr Thr Glu Val Phe Thr Ser Thr Leu Asp Asp 
195 200 205 

HI 865-1 WO SEQ 29 

SUBSTITUTE SHEET (RULE 26) 



BNSDOCID: <WO 992189QA1 I > 



WO 99/21890 PCT/SE98/01947 



CTG GAG GAG CGA GTG AAA GAG GCT GGG ATC GAG ATC ACT TTC CGA GAG 672 
Leu Glu Glu Arg Val Lys Glu Ala Gly lie Glu lie Thr Phe Arg Gin 
210 215 220 

AGT TTC TTC TCG GAT CCA GCT GTG CCT GTT AAA AAC CTG AAG CGT CAA 720 
Ser Phe Phe Ser Asp Pro Ala Val Pro Val Lys Asn Leu Lys Arg Gin 
225 230 235 240 

GAT GCT CGA ATC ATC GTG GGA CTT TTC TAT GAG ACG GAA GCC CGG AAA 768 
Asp Ala Arg lie lie Val Gly Leu Phe Tyr Glu Thr Glu Ala Arg Lys 
245 250 255 

GTT TTT TGT GAG GTC TAT AAG GAA AGG CTC TTT GGG AAG AAG TAC GTC 816 
Val Phe Cys Glu Val Tyr Lys Glu Arg Leu Phe Gly Lys Lys Tyr Val 
260 265 270 

TGG TTC CTC ATC GGG TGG TAT GCT GAC AAC TGG TTC AAG ACC TAT GAC 864 
Trp Phe Leu lie Gly Trp Tyr Ala Asp Asn Trp Phe Lys Thr Tyr Asp 
275 280 285 

CCG TCA ATC AAT TGT ACA GTG GAA GAA ATG ACC GAG GCG GTG GAG GGC 912 
Pro Ser He Asn Cys Thr Val Glu Glu Met Thr Glu Ala Val Glu Gly 
290 295 300 

CAC ATC ACC ACG GAG ATT GTC ATG CTG AAC CCT GCC AAC ACC CGA AGC 960 
His He Thr Thr Glu lie Val Met Leu Asn Pro Ala Asn Thr Arg Ser 
305 310 315 320 

ATT TCC AAC ATG ACG TCA CAG GAA TTT GTG GAG AAA CTA ACC AAG CGG 
He Ser Asn Met Thr Ser Gin Glu Phe Val Glu Lys Leu Thr Lys Arg 
325 330 335 

CTG AAA AGA CAC CCC GAG GAG ACT GGA GGC TTC CAG GAG GCA CCA CTG 1056 
Leu Lys Arg His Pro Glu Glu Thr Gly Gly Phe Gin Glu Ala Pro Le\* 
340 345 350 

GCC TAT GAT GCT ATC TGG GCC TTG GCT TTG GCC TTG AAC AAG ACG TCT 1104 
Ala Tyr Asp Ala He Trp Ala Leu Ala Leu Ala Leu Asn Lys Thr Ser 
355 360 365 

GGA GGA GGT GGT CGT TCC GGC GTG CGC CTG GAG GAC TTT AAC TAC AAC 1152 
Gly Gly Gly Gly Arg Ser Gly Val Arg Leu Glu Asp Phe Asn Tyr Asn 
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370 375 380 

AAC CAG ACC ATT ACA GAC CAG ATC TAC CGG GCC ATG AAC TCC TCC TCC 1200 
Asn Gin Thr lie Thr Asp Gin lie Tyr Arg Ala Met Asn Ser Ser Ser 
385 390 395 400 

TTT GAG GGC GTT TCT GGC CAT GTG GTC TTT GAT GCC AGC GGC TCC CGG 1248 
Phe Glu Gly Val Ser Gly His Val Val Phe Asp Ala Ser Gly Ser Arg 
405 410 415 

ATG GCA TGG ACA CTT ATC GAG CAG CTA CAG GGC GGC AGC TAC AAG AAG 1296 
Met Ala Trp Thr Leu lie Glu Gin Leu Gin Gly Gly Ser Tyr Lys Lys 
420 425 430 

ATC GGC TAC TAC GAC AGC ACC AAG GAT GAT CTT TCC TGG TCC AAA ACG 1344 
lie Gly Tyr Tyr Asp Ser Thr Lys Asp Asp Leu Ser Trp Ser Lys Thr 
435 440 445 

GAC AAG TGG ATT GGA GGG TCT CCC CCA GCT GAC CAG ACC TTG GTC ATC 1392 
Asp Lys Trp lie Gly Gly Ser Pro Pro Ala Asp Gin Thr Leu Val lie 
450 455 460 

AAG ACA TTC CGT TTC CTG TCT CAG AAA CTC TTT ATC TCC GTC TCA GTT 1440 
Lys Thr Phe Arg Phe Leu Ser Gin Lys Leu Phe lie Ser Val Ser Val 
465 470 475 480 

CTC TCC AGC CTG GGC ATT GTT CTT GCT GTT GTC TGT CTG TCC TTT AAC 1488 
Leu Ser Ser Leu Gly lie Val Leu Ala Val Val Cys Leu Ser Phe Asn 
485 490 495 

ATC TAC AAC TCC CAC GTT CGT TAT ATC CAG AAC TCC CAG CCC AAC CTG 1536 
lie Tyr Asn Ser His Val Arg Tyr lie Gin Asn Ser Gin Pro Asn Leu 
500 505 510 

AAC AAT CTG ACT GCT GTG GGC TGC TCA CTG GCA CTG GCT GCT GTC TTC 15 84 

Asn Asn Leu Thr Ala Val Gly Cys Ser Leu Ala Leu Ala Ala Val Phe 
515 520 525 * 

CCT CTC GGG CTG GAT GGT TAC CAC ATA GGG AGA AGC CAG TTC CCG TTT 1632 
Pro Leu Gly Leu Asp Gly Tyr His lie Gly Arg Ser Gin Phe Pro Phe 
530 535 540 

GTC TGC CAG GCC CGC CTT TGG CTC TTG GGC TTG GGC TTT AGT CTG GGC 1680 
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Val Cys Gin Ala Arg Leu Trp Leu Leu Gly Leu Gly Phe Ser Leu Giy ~ 
545 550 555 560 

TAT GGC TCT ATG TTC ACC AAG ATC TGG TGG GTC CAC ACA GTC TTC ACG 1728 
5 Tyr Gly Ser Met Phe Thr Lys lie Trp Trp Val His Thr Val Phe Thr 

565 570 575 

AAG AAG GAG GAG AAG AAG GAG TGG AGG AAG ACC CTA GAG CCC TGG AAA 1776 
Lys Lys Glu Glu Lys Lys Glu Trp Arg Lys Thr Leu Glu Pro Trp Lys 
10 580 585 590 

CTC TAT GCC ACT GTG GGC CTG CTG GTG GGC ATG GAT GTC CTG ACT CTT 1824 
Leu Tyr Ala Thr Val Gly Leu Leu Val Gly Met Asp Val Leu Thr Leu 
595 600 605 

GCC ATC TGG CAG ATT GTG GAC CCC TTG CAC CGA ACC ATT GAG ACT TTT 1872 
Ala He Trp Gin He Val Asp Pro Leu His Arg Thr He Glu Thr Phe 
610 615 620 

GCC AAG GAG GAA CCA AAG GAA GAC ATC GAT GTC TCC ATT CTG CCC CAG 1920 
Ala Lys Glu Glu Pro Lys Glu Asp He Asp Val Ser He Leu Pro Gin 
625 630 635 640 

TTG GAG CAC TGC AGC TCC AAG AAG ATG AAT ACG TGG CTT GGC ATT TTC 1968 
Leu Glu His Cys Ser Ser Lys Lys Met Asn Thr Trp Leu Gly He Phe 
645 650 655 

TAT GGT TAC AAG GGG CTG CTG CTG CTG CTG GGA ATC TTT CTT GCT TAC 2016 
Tyr Gly Tyr Lys Gly Leu Leu Leu Leu Leu Gly He Phe Leu Ala Tyr 
660 665 670 

GAA ACC AAG AGC GTG TCC ACT GAA AAG ATC AAT GAC CAC AGG GCC GTG 2064 
Glu Thr Lys Ser Val Ser Thr Glu Lys He Asn Asp His Arg Ala Val 
675 680 685 

GGC ATG GCT ATC TAC AAT GTC GCG GTC CTG TGT CTC ATC ACT GCT CCT 2112 
Gly Met Ala He Tyr Asn Val Ala Val Leu Cys Leu He Thr Ala Pro 
690 695 700 

40 GTG ACC ATG ATC CTT TCC AGT CAG CAG GAC GCA GCC TTT GCC TTT GCC 2160 
Val Thr Met He Leu Ser Ser Gin Gin Asp Ala Ala Phe Ala Phe Ala 
705 710 715 ' 720 
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TCT CTG GCC ATC 
Ser Leu Ala lie 

5 GTG CCC AAG ATG 
Val Pro Lys Met 
740 

ACG CAG GAC ACC 
10 Thr Gin Asp Thr 
755 

GAG AAG TCC CGA 
Glu Lys Ser Arg 
15 770 

ATC GCT GAG AAA 
lie Ala Glu Lys 
785 

20 

TCT CGG CAG CAA 
Ser Arg Gin Gin 

25 CCC TCT GGG GGC 
Pro Ser Gly Gly 
820 

AGC TGT GAT GGG 
30 Ser Cys Asp Gly 
835 



GTG TTC TCT TCC TAC 
Val Phe Ser Ser Tyr 
725 

CGC AGG CTG ATC ACC 
Arg Arg Leu lie Thr 
745 

ATG AAA ACA GGA TCA 
Met Lys Thr Gly Ser 
760 

CTG TTG GAG AAG GAA 
Leu Leu Glu Lys Glu 
775 

GAG GAG CGC GTC TCT 
Glu Glu Arg Val Ser 
. 790 

CTC CGC TCA CGG CGC 
Leu Arg Ser Arg Arg 
805 

CTT CCC AGG GGA CCC 
Leu Pro Arg Gly Pro 
825 

AGT CGA GTA CAT TTG 
Ser Arg Val His Leu 
840 



ATC ACT CTG GTT GTG 
lie Thr Leu Val Val 
730 

CGA GGG GAA TGG CAG 
Arg Gly Glu Trp Gin 
750 

TCC ACC AAC AAC AAC 
Ser Thr Asn Asn Asn 
765 

AAC CGA GAA CTG GAA 
Asn Arg Glu Leu Glu 
780 

GAA CTG CGC CAT CAG 
Glu Leu Arg His Gin 
795 

CAC CCC CCA ACA CCC 
His Pro Pro Thr Pro 
810 

TCT GAG CCC CCT GAC 
Ser Glu Pro Pro Asp 
830 

CTT TAC AAG TGA 
Leu Tyr Lys * 



PCTYSE98/01947 

CTC TTT 22-08 

Leu Phe 

735 

TCT GAA 2256 
Ser Glu 



GAG GAA 2304 
Glu Glu 

AAG ATC 23 52 

Lys lie 

CTC CAG 2400 
Leu Gin 
800 

CCA GAT 2448 

Pro Asp 

815 

CGG CTT 2496 
Arg Leu 

2535 



35 



40 



(2) INFORMATION FOR SEQ ID NO: 47: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 844 amino acids 

(B) TYPE: amino acid 
( D ) TOPOLOGY : 1 inear 

<ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 47: 
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Met Gly Pro Gly Gly Pro Cys Thr Pro Val Gly Trp Pro Leu Pro Leu 
1 5 10 15 

5 Leu Leu Val Met Ala Ala Gly Val Ala Pro Val Trp Ala Ser His Ser 
20 25 30 

Pro His Leu Pro Arg Pro His Pro Arg Val Pro Pro His Pro Ser Ser 
35 40 45 

10 

Glu Arg Arg Ala Val Tyr lie Gly Ala Leu Phe Pro Met Ser Gly Gly 
50 55 60 

Trp Pro Gly Gly Gin Ala Cys Gin Pro Ala Val Glu Met Ala Leu Glu 
15 65 70 75 80 

Asp Val Asn Ser Arg Arg Asp lie Leu Pro Asp Tyr Glu Leu Lys Leu 
85 90 95 

20 lie His His Asp Ser Lys Cys Asp Pro Gly Gin Ala Thr Lys Tyr Leu 
100 105 110 

Tyr Glu Leu Leu Tyr Asn Asp Pro lie Lys lie He Leu Met Pro Gly 
115 120 125 

25 

Cys Ser Ser Val Ser Thr Leu Val Ala Glu Ala Ala Arg Met Trp Asn 
130 135 140 

Leu He Val Leu Ser Tyr Gly Ser Ser Ser Pro Ala Leu Ser Asn Arg 
30 145 150 155 160 

Gin Arg Phe Pro Thr Phe Phe Arg Thr His Pro Ser Ala Thr Leu His 
165 170 175 

35 Asn Pro Thr Arg Val Lys Leu Phe Glu Lys Trp Gly Trp Lys Lys lie- 
180 185 190 

Ala Thr He Gin Gin Thr Thr Glu Val Phe Thr Ser Thr Leu Asp Asp 
195 200 205 

40 

Leu Glu Glu Arg Val Lys Glu Ala Gly He Glu He Thr Phe Arg Gin 
210 215 220 
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Ser Phe Phe Ser Asp Pro Ala Val Pro Val Lys Asn Leu Lys Arg Gin 
225 230 235 240 

Asp Ala Arg lie lie Val Gly Leu Phe Tyr Glu Thr Glu Ala Arg Lys 
5 245 250 255 

Val Phe Cys Glu Val Tyr Lys Glu Arg Leu Phe Gly Lys Lys Tyr Val 
260 265 270 

10 Trp Phe Leu lie Gly Trp Tyr Ala Asp Asn Trp Phe Lys Thr Tyr Asp 
275 280 285 

Pro Ser lie Asn Cys Thr Val Glu Glu Met Thr Glu Ala Val Glu Gly 
290 295 300 

15 

His lie Thr Thr Glu lie Val Met Leu Asn Pro Ala Asn Thr Arg Ser 
305 310 315 320 

lie Ser Asn Met Thr Ser Gin Glu Phe Val Glu Lys Leu Thr Lys Arg 
20 325 330 335 

Leu Lys Arg His Pro Glu Glu Thr Gly Gly Phe Gin Glu Ala Pro Leu 
340 345 350 

25 Ala Tyr Asp Ala lie Trp Ala Leu Ala Leu Ala Leu Asn Lys Thr Ser 
355 360 365 

Gly Gly Gly Gly Arg Ser Gly Val Arg Leu Glu Asp Phe Asn Tyr Asn 
370 375 380 

30 

Asn Gin Thr lie Thr Asp Gin lie Tyr Arg Ala Met Asn Ser Ser Ser 
385 390 395 400 

Phe Glu Gly Val Ser Gly His Val Val Phe Asp Ala Ser Gly Ser Arg 
35 405 410 415 

Met Ala Trp Thr Leu lie Glu Gin Leu Gin Gly Gly Ser Tyr Lys Lys 
420 425 430 

40 lie Gly Tyr Tyr Asp Ser Thr Lys Asp Asp Leu Ser Trp Ser Lys Thr 
435 440 445 

Asp Lys Trp lie Gly Gly Ser Pro Pro Ala Asp Gin Thr Leu Val lie 
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450 455 460 

Lys Thr Phe Arg Phe Leu Ser Gin Lys Leu Phe lie Ser Val Ser Val 
- 465 470 475 480 

5 

Leu Ser Ser Leu Gly He Val Leu Ala Val Val Cys Leu Ser Phe Asn 
485 490 495 

He T^rr Asn Ser His Val Arg Tyr He Gin Asn Ser Gin Pro Asn Leu 
10 500 505 510 

Asn Asn Leu Thr Ala Val Gly Cys Ser Leu Ala Leu Ala Ala Val Phe 
515 520 525 

15 Pro Leu Gly Leu Asp Gly Tyr His He Gly Arg Ser Gin Phe Pro Phe 
530 535 540 

Val Cys Gin Ala Arg Leu Trp Leu Leu Gly Leu Gly Phe Ser Leu Gly 
545 550 555 560 

20 

Tyr Gly Ser Met Phe Thr Lys He Trp Trp Val His Thr Val Phe Thr 
565 570 575 

Lys Lys Glu Glu Lys Lys Glu Trp Arg Lys Thr Leu Glu Pro Trp Lys 
25 580 585 590 

Leu Tyr Ala Thr Val Gly Leu Leu Val Gly Met Asp Val Leu Thr Leu 
595 600 605 

30 Ala He Trp Gin He Val Asp Pro Leu His Arg Thr He Glu Thr Phe 
610 615 620 

Ala Lys Glu Glu Pro Lys Glu Asp He Asp Val Ser He Leu Pro Gin 
625 630 635 640 

35 

Leu Glu His Cys Ser Ser Lys Lys Met Asn Thr Trp Leu Gly He Phe 
645 650 655 

Tyr Gly Tyr Lys Gly Leu Leu Leu Leu Leu Gly lie Phe Leu Ala Tyr 
40 660 665 670 

Glu Thr Lys Ser Val Ser Thr Glu Lys He Asn Asp His Arg Ala Val 
675 680 685 
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Gly Met Ala He Tyr Asn Val Ala Val Leu Cys Leu He Thr Ala Pro 
690 695 700 

5 Val Thr Met He Leu Ser Ser Gin Gin Asp Ala Ala Phe Ala Phe Ala 
705 710 715 720 

Ser Leu Ala He Val Phe Ser Ser Tyr He Thr Leu Val Val Leu Phe 
725 730 735 

10 

Val Pro Lys Met Arg Arg Leu He Thr Arg Gly Glu Trp Gin Ser Glu 
740 745 750 

Thr Gin Asp Thr Met Lys Thr Gly Ser Ser Thr Asn Asn Asn Glu Glu 
15 755 760 765 

Glu Lys Ser Arg Leu Leu Glu Lys Glu Asn Arg Glu Leu Glu Lys He 
770 775 780 

20 He Ala Glu Lys Glu Glu Arg Val Ser Glu Leu Arg His Gin Leu Gin 
785 790 795 800 

Ser Arg Gin Gin Leu Arg Ser Arg Arg His Pro Pro Thr Pro Pro Asp 
805 810 815 

25 

Pro Ser Gly Gly Leu Pro Arg Gly Pro Ser Glu Pro Pro Asp Arg Leu 
820 825 830 

Ser Cys Asp Gly Ser Arg Val His Leu Leu Tyr Lys 
30 835 840 



(2). INFORMATION FOR SEQ ID NO: 48: 

35 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2886 base pairs 
<B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



40 



(ii) MOLECULE TYPE: cDNA to mRNA 
(iii) HYPOTHETICAL: NO 



H1865-1 WO SEQ 37 

SUBSTITUTE SHEET (RULE 26) 



WO 99/21890 PCT/SE98/01947 



(iv) ANTI-SENSE: NO 



<vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo sapiens 

(ix) FEATURE: 

(A) NAME /KEY : CDS 

(B) LOCATION: 1. .2886 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 48: 



48 



96 



ATG TTG CTG CTG CTG CTA CTG GCG CCA CTC TTC CTC CGC CCC CCG GGC 
Met Leu Leu Leu Leu Leu Leu Ala Pro Leu Phe Leu Arg Pro Pro Gly 
15 10 15 

GCG GGC GGG GCG CAG ACC CCC AAC GCC ACC TCA GAA GGT TGC CAG ATC 
Ala Gly Gly Ala Gin Thr Pro Asn Ala Thr Ser Glu Gly Cys Gin lie 
20 25 30 

ATA CAC CCG CCC TGG GAA GGG GGC ATC AGG TAC CGG GGC CTG ACT CGG 144 
He His Pro Pro Trp Glu Gly Gly He Arg Tyr Arg Gly Leu Thr Arg 
35 40 45 

GAC CAG GTG AAG GCT ATC AAC TTC CTG CCA GTG GAC TAT GAG ATT GAG 192 
Asp Gin Val Lys Ala He Asn Phe Leu Pro Val Asp Tyr Glu He Glu 
50 55 60 

TAT GTG TGC CGG GGG GAG CGC GAG GTG GTG GGG CCC AAG GTC CGC AAG 240 
Tyr Val Cys Arg Gly Glu Arg Glu Val Val Gly Pro Lys Val Arg Lys 
65 70 75 80 

TGC CTG GCC AAC GGC TCC TGG ACA GAT ATG GAC ACA CCC AGC CGC TGT 2 88 

Cys Leu Ala Asn Gly Ser Trp Thr Asp Met Asp Thr Pro Ser Arg Cys 
85 90 95 - 

GTC CGA ATC TGC TCC AAG TCT TAT TTG ACC CTG GAA AAT GGG AAG GTT 3 36 

Val Arg He Cys Ser Lys Ser Tyr Leu Thr Leu Glu Asn Gly Lys Val 
100 105 HO 

TTC CTG ACG GGT GGG GAC CTC CCA GCT CTG GAC GGA GCC CGG GTG GAT 3 84 

Phe Leu Thr Gly Gly Asp Leu Pro Ala Leu Asp Gly Ala Arg Val Asp 
115 120 125 
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TTC CGG TGT GAC CCC GAC TTC CAT CTG GTG GGC AGC TCC CGG AGC ATC 432 

Phe Arg Cys Asp Pro Asp Phe His Leu Val Gly Ser Ser Arg Ser lie 

130 135 140 

5 

TGT AGT CAG GGC CAG TGG AGC ACC CCC AAG CCC CAC TGC CAG GTG AAT 480 

Cys Ser Gin Gly Gin Trp Ser Thr Pro Lys Pro His Cys Gin Val Asn 
145 150 155 160 

10 CGA ACG CCA CAC TCA GAA CGG CGC GCA GTG TAC ATC GGG GCA CTG TTT 528 

Arg Thr Pro His Ser Glu Arg Arg Ala Val Tyr lie Gly Ala Leu Phe 
165 170 175 



CCC ATG AGC GGG GGC TGG CCA GGG GGC CAG GCC TGC CAG CCC GCG GTG 576 

15 Pro Met Ser Gly Gly Trp Pro Gly Gly Gin Ala Cys Gin Pro Ala Val 
180 185 190 

GAG ATG GCG CTG GAG GAC GTG AAT AGC CGC AGG GAC ATC CTG CCG GAC 624 

Glu Met Ala Leu Glu Asp Val Asn Ser Arg Arg Asp tie Leu Pro Asp 
20 - 195 200 205 



TAT GAG CTC AAG CTC ATC CAC CAC GAC AGC AAG TGT GAT CCA GGC CAA 672 

Tyr Glu Leu Lys Leu lie His His Asp Ser Lys Cys Asp Pro Gly Gin 
210 215 220 

GCC ACC AAG TAC CTA TAT GAG CTG CTC TAC AAC GAC CCT ATC AAG ATC 72 0 

Ala Thr Lys Tyr Leu Tyr Glu Leu Leu Tyr Asn Asp Pro lie Lys lie 
225 230 235 240 



30 ATC CTT ATG CCT GGC TGC AGC TCT GTC TCC ACG CTG GTG GCT GAG GCT 768 
lie Leu Met Pro Gly Cys Ser Ser Val Ser Thr Leu Val Ala Glu Ala 
245 250 255 



GCT AGG ATG TGG AAC CTC ATT GTG CTT TCC TAT GGC TCC AGC TCA CCA 816 
35 Ala Arg Met Trp Asn Leu lie Val Leu Ser Tyr Gly Ser Ser Ser Pro 
260 265 270 



GCC CTG TCA AAC CGG CAG CGT TTC CCC ACT TTC TTC CGA ACG CAC CCA 864 
Ala Leu Ser Asn Arg Gin Arg Phe Pro Thr Phe Phe Arg Thr His Pro 
40 275 280 285 w 

TCA GCC AC A CTC CAC AAC CCT ACC CGC GTG AAA CTC TTT GAA AAG TGG 912 
Ser Ala Thr Leu His Asn Pro Thr Arg Val Lys Leu Phe Glu Lys Trp 
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290 295 300 

GGC TGG AAG AAG ATT GCT ACC ATC CAG CAG ACC ACT GAG GTC TTC ACT 9 60 

Gly Trp Lys Lys lie Ala Thr He Gin Gin Thr Thr Glu Val Phe Thr 
305 310 315 320 

TCG ACT CTG GAC GAC CTG GAG GAA CGA GTG AAG GAG GCT GGA ATT GAG 1008 
Ser Thr Leu Asp Asp Leu Glu Glu Arg Val Lys Glu Ala Gly He Glu 
- 325 330 335 

ATT ACT TTC CGC CAG AGT TTC TTC TCA GAT CCA GCT GTG CCC GTC AAA 1056 
He Thr Phe Arg Gin Ser Phe Phe Ser Asp Pro Ala Val Pro Val Lys 
340 345 350 

AAC CTG AAG CGC CAG GAT GCC CGA ATC ATC GTG GGA CTT TTC TAT GAG 1104 
Asn Leu Lys Arg Gin Asp Ala Arg He He Val Gly Leu Phe Tyr Glu 
355 360 365 

ACT GAA GCC CGG AAA GTT TTT TGT GAG GTG TAC AAG GAG CGT CTC TTT 1152 
Thr Glu Ala Arg Lys Val Phe Cys Glu Val Tyr Lys Glu Arg Leu Phe 
370 375 380 

GGG AAG AAG TAC GTC TGG TTC CTC ATT GGG TGG TAT GCT GAC AAT TGG 1200 
Gly Lys Lys Tyr Val Trp Phe Leu He Gly Trp Tyr Ala Asp Asn Trp 
385 390 395 400 

TTC AAG ATC TAC GAC CCT TCT ATC AAC TGC AC A GTG GAT GAG ATG ACT 1248 
Phe Lys He Tyr Asp Pro Ser He Asn Cys Thr Val Asp Glu Met Thr 
405 410 415 

GAG GCG GTG GAG GGC CAC ATC ACA ACT GAG ATT GTC ATG CTG AAT CCT 1296 
Glu Ala Val Glu Gly His He Thr Thr Glu He Val Met Leu Asn Pro 
420 425 430 

GCC AAT ACC CGC AGC ATT TCC AAC ATG ACA TCC CAG GAA TTT GTG GAG 1344 
Ala Asn Thr Arg Ser He Ser Asn Met Thr Ser Gin Glu Phe Val Glu 
435 440 445 

AAA CTA ACC AAG CGA CTG AAA AGA CAC CCT GAG GAG ACA GGA GGC TTC 1392 
Lys Leu Thr Lys Arg Leu Lys Arg His Pro . Glu Glu Thr Gly Gly Phe » 
450 455 460 

CAG GAG GCA CCG CTG GCC TAT GAT GCC ATC TGG GCC TTG GCA CTG GCC 1440 

» 

H1865-1 WOSEQ 40 

SUBSTITUTE SHEET (RULE 26) 



WO 99/21890 PCT/SE98/01947 

Gin Glu Ala Pro Leu Ala Tyr Asp Ala lie Trp Ala Leu Ala Leu Ala 
465 470 475 480 

CTG AAC AAG ACA TCT GGA GGA GGC GGC CGT TCT GGT GTG CGC CTG GAG 1488 
5 Leu Asn Lys Thr Ser Gly Gly Gly Gly Arg Ser Gly Val Arg Leu Glu 

485 490 495 

GAC TTC AAC TAC AAC AAC CAG ACC ATT ACC GAC CAA ATC TAC CGG GCA 153 6 

Asp Phe Asn Tyr Asn Asn Gin Thr lie Thr Asp Gin lie Tyr Arg Ala 
10 500 - 505 510 

ATG AAC TCT TCG TCC TTT GAG GGT GTC TCT GGC CAT GTG GTG TTT GAT 1584 

Met Asn Ser Ser Ser Phe Glu Gly Val Ser Gly His Val Val Phe Asp 
515 520 525 

15 

GCC AGC GGC TCT CGG ATG GCA TGG ACG CTT ATC GAG CAG CTT CAG GGT 1632 

Ala Ser Gly Ser Arg Met Ala Trp Thr Leu lie Glu Gin Leu Gin Gly 
530 535 540 

20 GGC AGC TAC AAG AAG ATT GGC TAC TAT GAC AGC ACC AAG GAT GAT CTT 1680 
Gly Ser Tyr Lys Lys lie Gly Tyr Tyr Asp Ser Thr Lys Asp Asp Leu 
545 550 555 560 

TCC TGG TCC AAA ACA GAT AAA TGG ATT GGA GGG TCC CCC CCA GCT GAC 1728 
25 Ser Trp Ser Lys Thr Asp Lys Trp lie Gly Gly Ser Pro Pro Ala Asp 

565 570 575 

CAG ACC CTG GTC ATC AAG ACA TTC CGC TTC CTG TCA CAG AAA CTC TTT 1776 
Gin Thr Leu Val lie Lys Thr Phe Arg Phe Leu Ser Gin Lys Leu Phe 
30 580 585 590 

ATC TCC GTC TCA GTT CTC TCC AGC CTG GGC ATT GTC CTA GCT GTT GTC 1824 

lie Ser Val Ser Val Leu Ser Ser Leu Gly lie Val Leu Ala Val Val 
595 600 605 

35 

TGT CTG TCC TTT AAC ATC TAC AAC TCA CAT GTC CGT TAT ATC CAG AAC 1872 

Cys Leu Ser Phe Asn lie Tyr Asn Ser His Val Arg Tyr lie Gin Asn 
610 615 620 

40 TCA CAG CCC AAC CTG AAC AAC CTG ACT GCT GTG GGC TGC TCA CTG GCT 1920:, 
Ser Gin Pro Asn Leu Asn Asn Leu Thr Ala Val Gly Cys Ser Leu Ala 
625 630 635 ' 640 
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TTA GCT GCT GTC 
Leu Ala Ala Val 

5 AAC CAG TTT CCT 
Asn Gin Phe Pro 
660 

GGC TTT AGT CTG 
10 Gly Phe Ser Leu 
675 

CAC ACG GTC TTC 
His Thr Val Phe 
15 690 

CTG GAA CCC TGG 
Leu Glu Pro Trp 
705 

20 

GAT GTC CTC ACT 
Asp Val Leu Thr 

25 ACC ATT GAG ACA 
Thr lie Glu Thr 
740 

TCT ATT CTG CCC 
30 Ser lie Leu Pro 
755 

TGG CTT GGC ATT 
Trp Leu Gly lie 
35 770 

ATC TTC CTT GCT 
lie Phe Leu Ala 
785 

40 

GAT CAC CGG GCT 
Asp His Arg Ala 



TTC CCC CTG GGG CTC 
Phe Pro Leu Gly Leu 
645 

TTC GTC TGC CAG GCC 
Phe Val Cys Gin Ala 
665 

GGC TAC GGT TCC ATG 
Gly Tyr Gly Ser Met 
680 

ACA AAG AAG GAA GAA 
Thr Lys Lys Glu Glu 
695 

AAG CTG TAT GCC ACA 
Lys Leu Tyr Ala Thr 
710 

CTC GCC ATC TGG CAG 
Leu Ala lie Trp Gin 
725 

TTT GCC AAG GAG GAA 
Phe Ala Lys Glu Glu 
745 

CAG CTG GAG CAT TGC 
Gin Leu Glu His Cys 
760 

TTC TAT GGT TAC AAG 
Phe Tyr Gly Tyr Lys 
775 

TAT GAG ACC AAG AGT 
Tyr Glu Thr Lys Ser 
790 

GTG GGC ATG GCT ATC 
Val Gly Met Ala lie 
805 



GAT GGT TAC CAC ATT 
Asp Gly Tyr His lie 
650 

CGC CTC TGG CTC CTG 
Arg Leu Trp Leu Leu 
670 

TTC ACC AAG ATT TGG 
Phe Thr Lys lie Trp 
685 

AAG AAG GAG TGG AGG 
Lys Lys Glu Trp Arg 
700 

GTG GGC CTG CTG GTG 
Val Gly Leu Leu Val 
715 

ATC GTG GAC CCT CTG 
lie Val Asp Pro Leu 
730 

CCT AAG GAA GAT ATT 
Pro Lys Glu Asp lie 
750 

AGC TCC AGG AAG ATG 
Ser Ser Arg Lys Met 
765 

GGG CTG CTG CTG CTG 
Gly Leu Leu Leu Leu 
780 

GTG TCC ACT GAG AAG 
Val Ser Thr Glu Lys 
795 

TAC AAT GTG GCA GTC 
Tyr Asn Val Ala Val 
810 



PCT/SE98/01947 

GGG AGG 19^8 
Gly Arg 
655 

GGC CTG 2 016 

Gly Leu 

TGG GTC 2064 
Trp Val 

AAG ACT 2112 
Lys Thr 

GGC ATG 2160 
Gly Met 
720 

CAC CGG 2208 
His Arg 
735 

GAC GTC 2256 
Asp Val 

AAT ACA 23 04 

Asn Thr 

CTG GGA 2352 
Leu Gly 

ATC AAT 2400 
lie Asn 
800 

CTG TGC 2448 
Leu Cys 
815 
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CTC ATC ACT GCT CCT GTC ACC ATG ATT CTG TCC AGC CAG CAG GAT GCA 2496 

Leu lie Thr Ala Pro Val Thr Met lie Leu Ser Ser Gin Gin Asp Ala 

820 825 830 

5 

GCC TTT GCC TTT GCC TCT CTT GCC ATA GTT TTC TCC TCC TAT ATC ACT 2544 

Ala Phe Ala Phe Ala Ser Leu Ala He Val Phe Ser Ser Tyr He Thr 

835 840 845 

10 CTT GTT GTG CTC TTT GTG CCC AAG ATG CGC AGG CTG ATC ACC CGA GGG 2592 
Leu Val Val Leu Phe Val Pro Lys Met Arg Arg Leu He Thr Arg Gly 
850 855 860 

GAA TGG CAG TCG GAG GCG CAG GAC ACC ATG AAG ACA GGG TCA TCG ACC 2640 
15 Glu Trp Gin Ser Glu Ala Gin Asp Thr Met Lys Thr Gly Ser Ser Thr 
865 870 875 880 

AAC AAC AAC GAG GAG GAG AAG TCC CGG CTG TTG GAG AAG GAG AAC CGT 2688 
Asn Asn Asn Glu Glu Glu Lys Ser Arg Leu Leu Glu Lys Glu Asn Arg 
20 885 890 895 

GAA CTG GAA AAG ATC ATT GCT GAG AAA GAG GAG CGT GTC TCT GAA CTG 2736 

Glu Leu Glu Lys He He Ala Glu Lys Glu Glu Arg Val Ser Glu Leu 

900 905 910 

25 

CGC CAT CAA CTC CAG TCT CGG CAG CAG CTC CGC TCC CGG CGC CAC CCA 2784 

Arg His Gin Leu Gin Ser Arg Gin Gin Leu Arg Ser Arg Arg His Pro 

915 920 925 

30 CCG ACA CCC CCA GAA CCC TCT GGG GGC CTG CCC AGG GGA CCC CCT GAG 2832 
Pro Thr Pro Pro Glu Pro Ser Gly Gly Leu Pro Arg Gly Pro Pro Glu 
930 935 940 

CCC CCC GAC CGG CTT AGC TGT GAT GGG AGT CGA GTG CAT TTG CTT TAT 2880 
35 Pro Pro Asp Arg Leu Ser Cys Asp Gly Ser Arg Val His Leu Leu Tyr- 
945 950 955 960 

AAG TGA 2886 
Lys * 

40 



(2) INFORMATION FOR SEQ ID NO: 49: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 961 amino acids 

(B) TYPE: amino acid 
5 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

^(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 49: 

10 

Met Leu Leu Leu Leu Leu Leu Ala Pro Leu Phe Leu Arg Pro Pro Gly 
1-5 10 15 

Ala Gly Gly Ala Gin Thr Pro Asn Ala Thr Ser Glu Gly Cys Gin lie 
IS 20 25 30 

lie His Pro Pro Trp Glu Gly Gly He Arg Tyr Arg Gly Leu Thr Arg 
35 40 45 

20 Asp Gin Val Lys Ala He Asn Phe Leu Pro Val Asp Tyr Glu He Glu 
50 55 60 

Tyr Val Cys Arg Gly Glu Arg Glu Val Val Gly Pro Lys Val Arg Lys 
65 70 75 80 

25 

Cys Leu Ala Asn Gly Ser Trp Thr Asp Met Asp Thr Pro Ser Arg Cys 
85 90 95 

Val Arg He Cys Ser Lys Ser Tyr Leu Thr Leu Glu Asn Gly Lys Val 
30 100 105 HO 

Phe Leu Thr Gly Gly Asp Leu Pro Ala Leu Asp Gly Ala Arg Val Asp 
115 120 125 

35 Phe Arg Cys Asp Pro Asp Phe His Leu Val Gly Ser Ser Arg Ser lie 
130 135 140 

Cys Ser Gin Gly Gin Trp Ser Thr Pro Lys Pro His Cys Gin Val Asn 
145 150 155 160 

40 

Arg Thr Pro His Ser Glu Arg Arg Ala Val Tyr He Gly Ala Leu Phe 
165 170 ' 175 
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Pro Met Ser Gly Gly Trp Pro Gly Gly Gin Ala Cys Gin Pro Ala Val 
180 185 190 

Glu Met Ala Leu Glu Asp Val Asn Ser Arg Arg Asp lie Leu Pro Asp 
5 195 200 205 

Tyr Glu Leu Lys Leu lie His His Asp Ser Lys Cys Asp Pro Gly Gin 
210 215 220 

10 Ala Thr Lys Tyr Leu Tyr ^ Glu Leu Leu Tyr Asn Asp Pro lie Lys lie 
225 230 235 240 

lie Leu Met Pro Gly Cys Ser Ser Val Ser Thr Leu Val Ala Glu Ala 
245 250 255 

15 

Ala Arg Met Trp Asn Leu lie Val Leu Ser Tyr Gly Ser Ser Ser Pro 
260 265 270 

Ala Leu Ser Asn Arg Gin Arg Phe Pro Thr Phe Phe Arg Thr His Pro 
20 275 280 285 

Ser Ala Thr Leu His Asn Pro Thr Arg Val Lys Leu Phe Glu Lys Trp 
290 295 300 

25 Gly Trp Lys Lys lie Ala Thr He Gin Gin Thr Thr Glu Val Phe Thr 
305 310 315 320 

Ser Thr Leu Asp Asp Leu Glu Glu Arg Val Lys Glu Ala Gly He Glu 
325 330 335 

30 

He Thr Phe Arg Gin Ser Phe Phe Ser Asp Pro Ala Val Pro Val Lys 
340 345 350 

Asn Leu Lys Arg Gin Asp Ala Arg He He Val Gly Leu Phe Tyr Glu 
35 355 360 365 

Thr Glu Ala Arg Lys Val Phe Cys Glu Val Tyr Lys Glu Arg Leu Phe 
370 375 380 

40 Gly Lys Lys Tyr Val Trp Phe Leu He Gly Trp Tyr Ala Asp Asn Trp 
385 390 395 400 

Phe Lys He Tyr Asp Pro Ser He Asn Cys Thr Val Asp Glu Met Thr 
HI 865-1 WO SEQ 45 
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410 
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415 



Glu Ala Val Glu Gly His lie Thr Thr Glu lie Val Met Leu Asn Pro 
420 425 430 

5 

Ala Asn Thr Arg Ser lie Ser Asn Met Thr Ser Gin Glu Phe Val Glu 
435 440 445 

Lys Leu Thr Lys Arg Leu Lys Arg His Pro Glu Glu Thr Gly Gly Phe 
10 450 455 460 

Gin Glu Ala Pro Leu Ala Tyr Asp Ala He Trp Ala Leu Ala Leu Ala 
465 470 475 480 

15 Leu Asn Lys Thr Ser Gly Gly Gly Gly Arg Ser Gly Val Arg Leu Glu 

485 490 495 

Asp Phe Asn Tyr Asn Asn Gin Thr He Thr Asp Gin He Tyr Arg Ala 
500 505 53,0 

20 

Met Asn Ser Ser Ser Phe Glu Gly Val Ser Gly His Val Val Phe Asp 

515 520 525 

Ala Ser Gly Ser Arg Met Ala Trp Thr Leu He Glu Gin Leu Gin Gly 
25 530 535 540 

Gly Ser Tyr Lys Lys He Gly Tyr Tyr Asp Ser Thr Lys Asp Asp Leu 
545 550 555 560 

30 Ser Trp Ser Lys Thr Asp Lys Trp He Gly Gly Ser Pro Pro Ala Asp 

565 570 575 

Gin Thr Leu Val He Lys Thr Phe Arg Phe Leu Ser Gin Lys Leu Phe 
580 585 590 

35 

He Ser Val Ser Val Leu Ser Ser Leu Gly He Val Leu Ala Val Val 
595 600 605 

Cys Leu Ser Phe Asn He Tyr Asn Ser His Val Arg Tyr He Gin Asn 
40 610 615 620 

Ser Gin Pro Asn Leu Asn Asn Leu Thr Ala Val Gly Cys Ser Leu Ala 
625 630 635 640 
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Leu Ala Ala Val Phe Pro Leu Gly Leu Asp Gly Tyr His lie Gly Arg 
645 650 655 

5 Asn Gin Phe Pro Phe Val Cys Gin Ala Arg Leu Trp Leu Leu Gly Leu 
660 665 670 

Gly Phe Ser Leu Gly Tyr Gly Ser Met Phe Thr Lys He Trp Trp Val 
675 680 685 

10 

His Thr Val Phe Thr Lys Lys Glu Glu Lys Lys Glu Trp Arg Lys Thr 
690 695 700 

Leu Glu Pro Trp Lys Leu Tyr Ala Thr Val Gly Leu Leu Val Gly Met 
15 705 710 715 720 

Asp Val Leu Thr Leu Ala He Trp Gin He Val Asp Pro Leu His Arg 
725 730 735 

20 Thr He Glu Thr Phe Ala Lys Glu Glu Pro Lys Glu Asp He Asp Val 
740 745 750 

Ser He Leu Pro Gin Leu Glu His Cys Ser Ser Arg Lys Met Asn Thr 
755 760 765 

25 

Trp Leu Gly He Phe Tyr Gly Tyr Lys Gly Leu Leu Leu Leu Leu Gly 
770 775 780 

He Phe Leu Ala Tyr Glu Thr Lys Ser Val Ser Thr Glu Lys He Asn 
30 785 790 795 800 

Asp His Arg Ala Val Gly Met Ala He Tyr Asn Val Ala Val Leu Cys 
805 810 815 

35 Leu He Thr Ala Pro Val Thr Met He Leu Ser Ser Gin Gin Asp ALa 
820 825 830 

Ala Phe Ala Phe Ala Ser Leu Ala He Val Phe Ser Ser Tyr He Thr 
835 840 845 

40 

Leu Val Val Leu Phe Val Pro Lys Met Arg Arg Leu lie Thr Arg Gly 
850 855 860 
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Glu Trp Gin Ser Glu Ala Gin Asp Thr Met Lys Thr Gly Ser Ser Thr 
865 870 875 880 

Asn Asn Asn Glu Glu Glu Lys Ser Arg Leu Leu Glu Lys Glu Asn Arg 
885 890 895 

Glu Leu Glu Lys lie lie Ala Glu Lys Glu Glu Arg Val Ser Glu Leu 
900 905 910 

Arg His Gin Leu Gin Ser Arg Gin Gin Leu Arg Ser Arg Arg His Pro 
915 920 925 

Pro Thr Pro Pro Glu Pro Ser Gly Gly Leu Pro Arg Gly Pro Pro Glu 
930 935 940 

Pro Pro Asp Arg Leu Ser Cys Asp Gly Ser Arg Val His Leu Leu Tyr 
945 950 955 960 

Lys 



(2) INFORMATION FOR SEQ ID NO: 50: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 253 5 base pairs 
<B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: cDNA to rRNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo sapiens 

(ix) FEATURE: 

(A) NAME/ KEY : CDS 

(B) LOCATION : 1 . .2535 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 50: 



ATG GGG CCC GGG GCC CCT TTT GCC CGG GTG GGG TGG CCA CTG CCG CTT 48 
Met Gly Pro Gly Ala Pro Phe Ala Arg Val Gly Trp Pro Leu Pro Leu 
5 1 5 10 15 



10 



CTG GTT GTG ATG GCG GCA GGG GTG GCT CCG GTG TGG GCC TCC CAC TCC 96 
Leu Val Val Met Ala Ala Gly Val Ala Pro Val Trp Ala Ser His Ser 
20 25 30 

CCC CAT CTC CCG CGG CCT CAC TCG CGG GTC CCC CCG CAC CCC TCC TCA 144 
Pro His Leu Pro Arg Pro His Ser Arg Val Pro Pro His Pro Ser Ser 
35 40 45 



IS GAA CGG CGC GCA GTG TAC ATC GGG GCA CTG TTT CCC ATG AGC GGG GGC 192 
Glu Arg Arg Ala Val Tyr lie Gly Ala Leu Phe Pro Met Ser Gly Gly 
50 55 60 



TGG CCA GGG GGC CAG GCC TGC CAG CCC GCG GTG GAG ATG GCG CTG GAG 240 
20 Trp Pro Gly Gly Gin Ala Cys Gin Pro Ala Val Glu Met Ala Leu Glu 
65 70 75 80 



GAC GTG AAT AGC CGC AGG GAC ATC CTG CCG GAC TAT GAG CTC AAG CTC 288 
Asp Val Asn Ser Arg Arg Asp lie Leu Pro Asp Tyr Glu Leu Lys Leu 
25 85 90 95 

ATC CAC CAC GAC AGC AAG TGT GAT CCA GGC CAA GCC ACC AAG TAC CTA 336 

lie His His Asp Ser Lys Cys Asp Pro Gly Gin Ala Thr Lys Tyr Leu 
100 105 110 

30 

TAT GAG CTG CTC TAC AAC GAC CCT ATC AAG ATC ATC CTT ATG CCT GGC 3 84 

Tyr Glu Leu Leu Tyr Asn Asp Pro lie Lys lie lie Leu Met Pro Gly 
115 120 125 



35 TGC AGC TCT GTC TCC ACG CTG GTG GCT 
Cys Ser Ser Val Ser Thr Leu Val Ala 
130 135 

CTC ATT GTG CTT TCC TAT GGC TCC AGC 
40 Leu lie Val Leu Ser Tyr Gly Ser Ser 
145 150 

CAG CGT TTC CCC ACT TTC TTC CGA ACG 



GAG GCT GCT AGG ATG TGG AAC 432 
Glu Ala Ala Arg Met Trp Asn 
140 

TCA CCA GCC CTG TCA AAC CGG 480 
Ser Pro Ala Leu Ser Asn Arg 
155 160 

CAC CCA TCA GCC ACA CTC CAC 528 
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Gin Arg Phe Pro Thr Phe Phe Arg Thr His Pro Ser Ala Thr Leu His — 
165 170 175 

AAC CCT ACC CGC GTG AAA CTC TTT GAA AAG TGG GGC TGG AAG AAG ATT 576 
5 Asn Pro Thr Arg Val Lys Leu Phe Glu Lys Trp Gly Trp Lys Lys lie 
180 185 190 

GCT ACC ATC CAG CAG ACC ACT GAG GTC TTC ACT TCG ACT CTG GAC GAC 624 
Ala Thr lie Gin Gin Thr Thr Glu Val Phe Thr Ser Thr Leu Asp Asp 
10 195 200 205 

CTG GAG GAA CGA GTG AAG GAG GCT GGA ATT GAG ATT ACT TTC CGC CAG 672 

Leu Glu Glu Arg Val Lys Glu Ala Gly lie Glu lie Thr Phe Arg Gin 
210 215 220 

15 

AGT TTC TTC TCA GAT CCA GCT GTG CCC GTC AAA AAC CTG AAG CGC CAG 720 

Ser Phe Phe Ser Asp Pro Ala Val Pro Val Lys Asn Leu Lys Arg Gin 

225 230 235 240 

20 GAT GCC CGA ATC ATC GTG GGA CTT TTC TAT GAG ACT GAA GCC CGG AAA 768 
Asp Ala Arg lie lie Val Gly Leu Phe Tyr Glu Thr Glu Ala Arg Lys 
245 250 255 

GTT TTT TGT GAG GTG TAC AAG GAG CGT CTC TTT GGG AAG AAG TAC GTC 816 
25 Val Phe Cys Glu Val Tyr Lys Glu Arg Leu Phe Gly Lys Lys Tyr Val 
260 265 270 

TGG TTC CTC ATT GGG TGG TAT GCT GAC AAT TGG TTC AAG ATC TAC GAC 864 
Trp Phe Leu He Gly Trp Tyr Ala Asp Asn Trp Phe Lys He Tyr Asp 
30 275 280 285 

CCT TCT ATC AAC TGC ACA GTG GAT GAG ATG ACT GAG GCG GTG GAG GGC 912 
Pro Ser He Asn Cys Thr Val Asp Glu Met Thr Glu Ala Val Glu Gly 
290 295 300 

35 

CAC ATC ACA ACT GAG ATT GTC ATG CTG AAT CCT GCC AAT ACC CGC AGC 960 
His He Thr Thr Glu He Val Met Leu Asn Pro Ala Asn Thr Arg Ser 
305 310 315 320 

40 ATT TCC AAC ATG ACA TCC CAG GAA TTT GTG. GAG AAA CTA ACC AAG CGA 1008 
He Ser Asn Met Thr Ser Gin Glu Phe Val Glu Lys Leu Thr Lys Arg 
325 330 *335 
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CTG AAA AGA CAC CCT GAG GAG ACA GGA GGC TTC CAG GAG GCA CCG CTG 
Leu Lys Arg His Pro Glu Glu Thr Gly Gly Phe Gin Glu Ala Pro Leu 
340 345 350 



105-6 



GCC TAT GAT GCC ATC TGG GCC TTG GCA CTG GCC CTG AAC AAG ACA TCT 
Ala Tyr Asp Ala lie Trp Ala Leu Ala Leu Ala Leu Asn Lys Thr Ser 
355 360 365 



1104 



GGA GGA GGC GGC CGT TCT GGT GTG CGC CTG GAG GAC TTC AAC TAC AAC 
Gly Gly Gly Gly Arg Ser Gly Val Arg Leu Glu Asp Phe Asn Tyr Asn 
370 375 380 



1152 



AAC CAG ACC ATT ACC GAC CAA ATC TAC CGG GCA ATG AAC TCT TCG TCC 
Asn Gin Thr He Thr Asp Gin He Tyr Arg Ala Met Asn Ser Ser Ser 
385 390 395 400 



1200 



TTT GAG GGT GTC TCT GGC CAT GTG GTG TTT GAT GCC AGC GGC TCT CGG 
Phe Glu Gly Val Ser Gly His Val Val Phe Asp Ala Ser Gly Ser Arg 
40.5 410 415 



1248 



ATG GCA TGG ACG CTT ATC GAG CAG CTT CAG GGT GGC AGC TAC AAG AAG 
Met Ala Trp Thr Leu He Glu Gin Leu Gin Gly Gly Ser Tyr Lys Lys 
420 425 430 



1296 



ATT GGC TAC TAT GAC AGC ACC AAG GAT GAT CTT TCC TGG TCC AAA ACA 
lie Gly Tyr Tyr Asp Ser Thr Lys Asp Asp Leu Ser Trp Ser Lys Thr 
435 440 445 



1344 



GAT AAA TGG ATT GGA GGG TCC CCC CCA GCT GAC CAG ACC CTG GTC ATC 
Asp Lys Trp He Gly Gly Ser Pro Pro Ala Asp Gin Thr Leu Val He 
450 455 460 



1392 



AAG ACA TTC CGC TTC CTG TCA CAG AAA CTC TTT ATC TCC GTC TCA GTT 
Lys Thr Phe Arg Phe Leu Ser Gin Lys Leu Phe He Ser Val Ser Val 
465 470 475 4&0 



1440 



CTC TCC AGC CTG GGC ATT GTC CTA GCT GTT GTC TGT CTG TCC TTT AAC 
Leu Ser Ser Leu Gly He Val Leu Ala Val Val Cys Leu Ser Phe Asn 
485 490 495 



1488 



ATC TAC AAC TCA CAT GTC CGT TAT ATC CAG AAC TCA CAG CCC AAC CTG 
He Tyr Asn Ser His Val Arg Tyr He Gin Asn Ser Gin Pro*Asn Leu 
500 505 510 



1536 
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AAC AAC CTG ACT GCT GTG GGC TGC TCA CTG GCT TTA GCT GCT GTC TTC 1584 
Asn Asn Leu Thr Ala Val Gly Cys Ser Leu Ala Leu Ala Ala Val Phe 
515 520 525 

CCC CTG GGG CTC GAT GGT TAC CAC ATT GGG AGG AAC CAG TTT CCT TTC 1632 
Pro Leu Gly Leu Asp Gly Tyr His lie Gly Arg Asn Gin Phe Pro Phe 
530 535 540 

GTC TGC CAG GCC CGC CTC TGG CTC CTG GGC CTG GGC TTT AGT CTG GGC 1680 
Val Cys Gin Ala Arg Leu Trp Leu Leu Gly Leu Gly Phe Ser Leu Gly 
545 550 555 560 

TAC GGT TCC ATG TTC ACC AAG ATT TGG TGG GTC CAC ACG GTC TTC ACA 1728 
Tyr Gly Ser Met Phe Thr Lys He Trp Trp Val His Thr Val Phe Thr 
565 570 575 

AAG AAG GAA GAA AAG AAG GAG TGG AGG AAG ACT CTG GAA CCC TGG AAG 1776 
Lys Lys Glu Glu Lys Lys Glu Trp Arg Lys Thr Leu Glu Pro Trp Lys 
580 585 590 

CTG TAT GCC ACA GTG GGC CTG CTG GTG GGC ATG GAT GTC CTC ACT CTC 1824 
Leu Tyr Ala Thr Val Gly Leu Leu Val Gly Met Asp Val Leu Thr Leu 
595 600 605 

GCC ATC TGG CAG ATC GTG GAC CCT CTG CAC CGG ACC ATT GAG ACA TTT 1872 
Ala He Trp Gin He Val Asp Pro Leu His Arg Thr He Glu Thr Phe 
610 615 620 

GCC AAG GAG GAA CCT AAG GAA GAT ATT GAC GTC TCT ATT CTG CCC CAG 1920 
Ala Lys Glu Glu Pro Lys Glu Asp He Asp Val Ser He Leu Pro Gin 
625 630 635 640 

CTG GAG CAT TGC AGC TCC AGG AAG ATG AAT ACA TGG CTT GGC ATT TTC 1968 
Leu Glu His Cys Ser Ser Arg Lys Met Asn Thr Trp Leu Gly He Phe- 
645 650 655 

TAT GGT TAC AAG GGG CTG CTG CTG CTG CTG GGA ATC TTC CTT GCT TAT 2016 
Tyr Gly Tyr Lys Gly Leu Leu Leu Leu Leu Gly He Phe Leu Ala Tyr 
660 66 5 670 



GAG ACC AAG AGT GTG TCC ACT GAG AAG ATC AAT GAT CAC CGG GCT GTG 
Glu Thr Lys Ser Val Ser Thr Glu Lys He Asn Asp His Arg Ala Val 



2064 
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675 680 685 — 

GGC ATG GCT ATC TAC AAT GTG GCA GTC CTG TGC CTC ATC ACT GCT CCT 2112 
Gly Met Ala lie Tyr Asn Val Ala Val Leu Cys Leu lie Thr Ala Pro 
5 690 695 700 

GTC ACC ATG ATT CTG TCC AGC CAG CAG GAT GCA GCC TTT GCC TTT GCC 2160 

Val Thr Met lie Leu Ser Ser Gin Gin Asp Ala Ala Phe Ala Phe Ala 

705 710 715 720 - 

10 

TCT CTT GCC ATA GTT TTC TCC TCC TAT ATC ACT CTT GTT GTG CTC TTT 2208 

Ser Leu Ala lie Val Phe Ser Ser Tyr lie Thr Leu Val Val Leu Phe 

725 730 735 

15 GTG CCC AAG ATG CGC AGG CTG ATC ACC CGA GGG GAA TGG CAG TCG GAG 2256 
Val Pro Lys Met Arg Arg Leu lie Thr Arg Gly Glu Trp Gin Ser Glu 
740 745 750 

GCG CAG GAC ACC ATG. AAG ACA GGG TCA TCG ACC AAC AAC AAC GAG GAG 23 04 

20 Ala Gin Asp Thr Met Lys Thr Gly Ser Ser Thr Asn Asn Asn Glu Glu 
755 760 765 

GAG AAG TCC CGG CTG TTG GAG AAG GAG AAC CGT GAA CTG GAA AAG ATC 2352 
Glu Lys Ser Arg Leu Leu Glu Lys Glu Asn Arg Glu Leu Glu Lys lie 
25 770 775 780 

ATT GCT GAG AAA GAG GAG CGT GTC TCT GAA CTG CGC CAT CAA CTC CAG 2400 
lie Ala Glu Lys Glu Glu Arg Val Ser Glu Leu Arg His Gin Leu Gin 
785 790 795 800 

30 

TCT CGG CAG CAG CTC CGC TCC CGG CGC CAC CCA CCG ACA CCC CCA GAA 2448 
Ser Arg Gin Gin Leu Arg Ser Arg Arg His Pro Pro Thr Pro Pro Glu 
805 810 815 

35 CCC TCT GGG GGC CTG CCC AGG GGA CCC CCT GAG CCC CCC GAC CGG CTT- 2496 
Pro Ser Gly Gly Leu Pro Arg Gly Pro Pro Glu Pro Pro Asp Arg Leu 
820 825 830 

AGC TGT GAT GGG AGT CGA GTG CAT TTG CTT TAT AAG TGA 253 5 

40 Ser Cys Asp Gly Ser Arg Val His Leu Leu Tyr Lys * 
835 840 
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(2) INFORMATION FOR SEQ ID NO: 51: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 844 amino acids 
5 (B) TYPE: amino acid 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: protein 

10 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 51: 

Met Gly Pro Gly Ala Pro Phe Ala Arg Val Gly Trp Pro Leu Pro Leu 
15 10 15 

15 Leu Val Val Met Ala Ala Gly Val Ala Pro Val Trp Ala Ser His Ser 
20 25 30 

Pro His Leu Pro Arg Pro His Ser Arg Val Pro Pro His Pro Ser Ser 
35 40 45 

20 

Glu Arg Arg Ala Val Tyr lie Gly Ala Leu Phe Pro Met Ser Gly Gly 
50 55 60 

Trp Pro Gly Gly Gin Ala Cys Gin Pro Ala Val Glu Met Ala Leu Glu 
25 65 70 75 80 

Asp Val Asn Ser Arg Arg Asp lie Leu Pro Asp Tyr Glu Leu Lys Leu 
85 90 95 

30 lie His His Asp Ser Lys Cys Asp Pro Gly Gin Ala Thr Lys Tyr Leu 
100 105 110 

Tyr Glu Leu Leu Tyr Asn Asp Pro lie Lys lie lie Leu Met Pro Gly 
115 120 125 

35 

Cys Ser Ser Val Ser Thr Leu Val Ala Glu Ala Ala Arg Met Trp Asn 
130 135 140 

Leu lie Val Leu Ser Tyr Gly Ser Ser Ser Pro Ala Leu Ser Asn Arg 
40 145 150 155 160 

Gin Arg Phe Pro Thr Phe Phe Arg Thr His Pro Ser Ala Thr Leu His 
165 170 175 
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Asn Pro Thr Arg Val Lys Leu Phe Glu Lys Trp Gly Trp Lys Lys lie 
180 185 190 

Ala Thr lie Gin Gin Thr Thr Glu Val Phe Thr Ser Thr Leu Asp Asp 
195 200 205 

Leu Glu Glu Arg Val Lys Glu Ala Gly lie Glu lie Thr Phe Arg Gin 
210 215 220 

Ser Phe Phe Ser Asp Pro Ala Val Pro Val Lys Asn Leu Lys Arg Gin 
225 230 235 240 

Asp Ala Arg He He Val Gly Leu Phe Tyr Glu Thr Glu Ala Arg Lys 
245 250 255 

Val Phe Cys Glu Val Tyr Lys Glu Arg Leu Phe Gly Lys Lys Tyr Val 
260 265 270 

Trp Phe Leu He Gly Trp Tyr Ala Asp Asn Trp Phe Lys He Tyr Asp 
275 280 285 

Pro Ser He Asn Cys Thr Val Asp Glu Met Thr Glu Ala Val Glu Gly 
290 295 300 

His He Thr Thr Glu He Val Met Leu Asn Pro Ala Asn Thr Arg Ser 
305 310 315 320 

He Ser Asn Met Thr Ser Gin Glu Phe Val Glu Lys Leu Thr Lys Arg 
325 330 335 

Leu Lys Arg His Pro Glu Glu Thr Gly Gly Phe Gin Glu Ala Pro Leu 
340 345 350 

Ala Tyr Asp Ala He Trp Ala Leu Ala Leu Ala Leu Asn Lys Thr Ser 
355 360 365 

Gly Gly Gly Gly Arg Ser Gly Val Arg Leu Glu Asp Phe Asn Tyr Asn 
370 375 380 

Asn Gin Thr He Thr Asp Gin He Tyr Arg Ala Met Asn Ser Ser Ser 
385 390 395 ' 400 
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Phe Glu Gly Val Ser Gly His Val Val Phe Asp Ala Ser Gly Ser Arg ~ 
405 410 415 

Met Ala Trp Thr Leu He Glu Gin Leu Gin Gly Gly Ser Tyr Lys Lys 
420 425 430 

He Gly Tyr Tyr Asp Ser Thr Lys Asp Asp Leu Ser Trp Ser Lys Thr 
435 440 445 

Asp Lys Trp lie Gly Gly Ser Pro Pro Ala Asp Gin Thr Leu Val He 
450 455 460 

Lys Thr Phe Arg Phe Leu Ser Gin Lys Leu Phe lie Ser Val Ser Val 
465 470 475 480 

Leu Ser Ser Leu Gly He Val Leu Ala Val Val Cys Leu Ser Phe Asn 
485 490 495 

He Tyr Asn Ser His Val Arg Tyr He Gin Asn Ser Gin Pro Asn Leu 
500 505 510 

Asn Asn Leu Thr Ala Val Gly Cys Ser Leu Ala Leu Ala Ala Val Phe 
515 520 525 

Pro Leu Gly Leu Asp Gly Tyr His He Gly Arg Asn Gin Phe Pro Phe 
530 535 540 

Val Cys Gin Ala Arg Leu Trp Leu Leu Gly Leu Gly Phe Ser Leu Gly 
545 550 ' 555 560 

Tyr Gly Ser Met Phe Thr Lys He Trp Trp Val His Thr Val Phe Thr 
565 570 575 

Lys Lys Glu Glu Lys Lys Glu Trp Arg Lys Thr Leu Glu Pro Trp Lys 
580 585 590 

Leu Tyr Ala Thr Val Gly Leu Leu Val Gly Met Asp Val Leu Thr Leu 
595 600 605 

Ala He Trp Gin He Val Asp Pro Leu His Arg Thr He Glu Thr Phe 
610 615 620 

Ala Lys Glu Glu Pro Lys Glu Asp He Asp Val Ser He Leu Pro Gin 
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625 630 635 640 — 

Leu Glu His Cys Ser Ser Arg Lys Met Asn Thr Trp Leu Gly lie Phe 
645 650 655 

5 

Tyr Gly Tyr Lys Gly Leu Leu Leu Leu Leu Gly lie Phe Leu Ala Tyr 
660 665 670 

Glu Thr Lys Ser Val Ser Thr Glu Lys lie Asn Asp His Arg Ala Val 
10 675 680 685 

Gly Met Ala lie Tyr Asn Val Ala Val Leu Cys Leu lie Thr Ala Pro 
690 695 700 

15 Val Thr Met He Leu Ser Ser Gin Gin Asp Ala Ala Phe Ala Phe Ala 
705 710 715 720 

Ser Leu Ala He Val Phe Ser Ser Tyr He Thr Leu Val Val Leu Phe 
725 730 735 

20 

Val Pro Lys Met Arg Arg Leu He Thr Arg Gly Glu Trp Gin Ser Glu 
740 745 750 

Ala Gin Asp Thr Met Lys Thr Gly Ser Ser Thr Asn Asn Asn Glu Glu 
25 755 760 765 

Glu Lys Ser Arg Leu Leu Glu Lys Glu Asn Arg Glu Leu Glu Lys He 
770 775 780 

30 He Ala Glu Lys Glu Glu Arg Val Ser Glu Leu Arg His Gin Leu Gin 
785 790 795 800 

Ser Arg Gin Gin Leu Arg Ser Arg Arg His Pro Pro Thr Pro Pro Glu 
805 810 815 

35 

Pro Ser Gly Gly Leu Pro Arg Gly Pro Pro Glu Pro Pro Asp Arg Leu 
820 825 830 

Ser Cys Asp Gly Ser Arg Val His Leu Leu Tyr Lys 
40 835 840 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2895 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: CDNA to mRNA 
(iii) HYPOTHETICAL: NO 



(iv) ANTI-SENSE: NO 



(vi) ORIGINAL SOURCE: 

(A) . ORGANISM: Canis familiaris 

(ix) FEATURE: 

(A) NAME /KEY : CDS 

(B) LOCATIONil. .2895 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 52: 



ATG CTG CTG CTG CTC CTG CCG CTG GCG CTG GCG CCG CTC TTC CTC CGC 48 
Met Leu Leu Leu Leu Leu Pro Leu Ala Leu Ala Pro Leu Phe Leu Arg 
15 10 15 

CCC CCG GGC GCG GGC GGG GCA CAG ACC CCC AAC GCC ACC TCG GAA GGT 96 
Pro Pro Gly Ala Gly Gly Ala Gin Thr Pro Asn Ala Thr Ser Glu Gly 
20 25 30 

TGC CAG ATC ATA CAC CCG CCT TGG GAA GGG GGT ATC AGG TAC AGG GGC 144 
Cys Gin lie lie His Pro Pro Trp Glu Gly Gly lie Arg Tyr Arg Gly 
35 40 45 

CTG ACT CGT GAC CAG GTG AAG GCT ATC AAC TTC CTG CCG GTG GAC TAT" 192 
Leu Thr Arg Asp Gin Val Lys Ala lie Asn Phe Leu Pro Val Asp Tyr 
50 55 60 

GAG ATT GAG TAT GTG TGC CGG GGA GAG CGA GAG GTG GTG GGG CCC AAG 240 
Glu lie Glu Tyr Val Cys Arg Gly Glu Arg. Glu Val Val Gly Pro Lys 
65 70 75 80 



GTC CGA AAG TGC CTG GCC AAT GGC TCC TGG ACA GAT ATG GAC ACA CCC 
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Val Arg Lys Cys Leu Ala Asn Gly Ser Trp Thr Asp Met Asp Thr Pro — 
85 90 95 

AGC CGC TGT GTC CGA ATC TGT TCC AAG TCA TAT TTG GCC CTG GAA AAT 33 6 

5 Ser Arg Cys Val Arg lie Cys Ser Lys Ser Tyr Leu Ala Leu Glu Asn 
100 105 110 

GGG AAG GTC TTC CTG ACG GGT GGG GAC CTC CCC GCT CTG GAT GGA GCC 3 84 

Gly Lys Val Phe Leu Thr Gly Gly Asp Leu Pro Ala Leu Asp Gly Ala 
10 115 - 120 125 

CGG GTG GAT TTC CGG TGT GAC CCT GAC TTC CAT CTT GTG GGC AGC TCC 432 

Arg Val Asp Phe Arg Cys Asp Pro Asp Phe His Leu Val Gly Ser Ser 
130 135 140 

15 

CGG AGT ATC TGT AGT CAG GGC CAG TGG AGC ACT CCC AAG CCC CAC TGC 480 

Arg Ser lie Cys Ser Gin Gly Gin Trp Ser Thr Pro Lys Pro His Cys 
145 150 155 160 

20 CAG GTG AGC CGA ACG CCG CAC TCA GAG CGG CGA GCG GTG TAC ATC GGG 528 
Gin Val Ser Arg Thr Pro His Ser Glu Arg Arg Ala Val Tyr lie Gly 
165 170 175 

GCG CTG TTT CCC ATG AGC GGG GGC TGG CCG GGG GGC CAG GCC TGC CAG 576 
25 Ala Leu Phe Pro Met Ser Gly Gly Trp Pro Gly Gly Gin Ala Cys Gin 
180 185 190 

CCC GCG GTG GAG ATG GCG CTG GAG GAC GTG AAT AGC CGC AGG GAC ATC 624 
Pro Ala Val Glu Met Ala Leu Glu Asp Val Asn Ser Arg Arg Asp lie 
30 195 200 205 

CTG CCG GAC TAC GAG CTC AAG CTC ATC CAC CAC GAC AGC AAG TGT GAC 672 

Leu Pro Asp Tyr Glu Leu Lys Leu lie His His Asp Ser Lys Cys Asp 
210 215 220 

35 

CCA GGC CAA GCT ACC AAG TAC CTG TAT GAA CTG CTC TAC AAC GAC CCC 720 

Pro Gly Gin Ala Thr Lys Tyr Leu Tyr Glu Leu Leu Tyr Asn Asp Pro 
" 225 230 235 240 

40 ATC AAG ATC ATC CTC ATG CCT GGC TGC AGC TCT GTC TCC ACG CTT GTG 768., 
lie Lys lie lie Leu Met Pro Gly Cys Ser Ser Val Ser Thr Leu Val 
245 250 *255 



H1865-1 WOSEQ 59 

SUBSTITUTE SHEET (RULE 26) 



10 



WO 99/21890 PCT/SE98/01947 

GCT GAG GCT GCC AGG ATG TGG AAC CTC ATT GTG CTC TCC TAT GGT TCC 8*6 
Ala Glu Ala Ala Arg Met Trp Asn Leu He Val Leu Ser Tyr Gly Ser 
260 265 270 

AGC TCA CCA GCT CTG TCC AAC CGG CAG CGC TTT CCT ACC TTC TTC CGA 864 
Ser Ser Pro Ala Leu Ser Asn Arg Gin Arg Phe Pro Thr Phe Phe Arg 
275 280 285 

ACT CAT CCC TCG GCC ACG CTC CAC AAC CCT ACG CGA GTG AAG CTC TTT 912 
Thr His Pro Ser Ala Thr Leu His Asn Pro Thr Arg Val Lys Leu Phe 
290 295 300 



GAG AAG TGG GGC TGG AGG AAG ATT GCC ACC ATC CAG CAG ACC ACC GAG 960 
Glu Lys Trp Gly Trp Arg Lys He Ala Thr He Gin Gin Thr Thr Glu 
,5 305 310 315 320 

GTG TTC ACA TCG ACT CTG GAC GAC CTA GAG GAA CGA GTG AAG GAG GCT 1008 
Val Phe Thr Ser Thr Leu Asp Asp Leu Glu Glu Arg Val Lys Glu Ala 
325. 330 335 

20 

GGG ATT GAG ATT ACT TTC CGC CAG AGC TTC TTC TCA GAT CCT GCC GTG 1056 
Gly He Glu He Thr Phe Arg Gin Ser Phe Phe Ser Asp Pro Ala Val 
340 345 350 

25 CCT GTC AAG AAC CTC AAG CGC CAG GAT GCC CGA ATC ATC GTG GGA CTT 1104 
Pro Val Lys Asn Leu Lys Arg Gin Asp Ala Arg He He Val Gly Leu 
355 360 365 

TTC TAT GAG ACT GAA GCC CGG AAA GTG TTC TGT GAG GTA TAC AAG GAG 1152 
30 Phe Tyr Glu Thr Glu Ala Arg Lys Val Phe Cys Glu Val Tyr Lys Glu 
370 375 380 

CGG CTC TTT GGG AAG AAG TAT GTG TGG TTC CTC ATT GGG TGG TAT GCT 12 00 

Arg Leu Phe Gly Lys Lys Tyr Val Trp Phe Leu He Gly Trp Tyr Ala 
35 385 390 395 400^ 

GAC AAT TGG TTC AAG ACC TAC GAC CCC TCC ATC AAC TGC ACA GTG GAT 1248 
Asp Asn Trp Phe Lys Thr Tyr Asp Pro Ser He Asn Cys Thr Val Asp 
405 410 415 



40 



GAG ATG ACC GAG GCT GTG GAA GGC CAC ATC ACC ACT GAG ATT GTC ATG 1296 
Glu Met Thr Glu Ala Val Glu Gly His He Thr Thr Glu He Val Met 
420 425 430 
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CTG AAC CCA GCC AAC ACC CGC AGC ATC TCC AAC ATG ACA TCC CAG GAG 1344 

Leu Asn Pro Ala Asn Thr Arg Ser lie Ser Asn Met Thr Ser Gin Glu 

435 440 445 

5 

TTT GTG GAG AAA CTG ACC AAG AGA CTC AAG AGA CAC CCT GAG GAG ACA 1392 

Phe Val Glu Lys Leu Thr Lys Arg Leu Lys Arg His Pro Glu Glu Thr 

450 455 460 

10 GGC GGC TTC CAG GAG GCA CCG CTG GCC TAT GAT GCC ATC TGG GCC TTG 1440 
Gly Gly Phe Gin Glu Ala Pro Leu Ala Tyr Asp Ala lie Trp Ala Leu 
465 470 475 480 

GCA TTG GCC CTG AAC AAG ACA TCT GGA GGG AGC GGC CGT TCG GGG GTG 1488 
15 Ala Leu Ala Leu Asn Lys Thr Ser Gly Gly Ser Gly Arg Ser Gly Val 

485 490 495 

CGC CTG GAA GAC TTC AAC TAC AAC AAC CAG ACG ATC ACA GAC CAA ATC 153 6 

Arg Leu Glu Asp Phe Asn Tyr Asn Asn Gin Thr lie Thr Asp Gin lie 
20 500 505 510 

TAC CGC GCA ATG AAC TCC TCG TCC TTT GAG GGT GTC TCT GGC CAC GTG 1584 
Tyr Arg Ala Met Asn Ser Ser Ser Phe Glu Gly Val Ser Gly His Val 
515 520 525 

25 

GTG TTT GAT GCC AGC GGC TCA CGG ATG GCC TGG ACT CTG ATT GAG CAG 1632 
Val Phe Asp Ala Ser Gly Ser Arg Met Ala Trp Thr Leu lie Glu Gin 
530 535 540 

30 CTG CAG GGT GGC AGC TAC AAG AAG ATC GGC TAC TAT GAC AGC ACC AAG 1680 
Leu Gin Gly Gly Ser Tyr Lys Lys lie Gly Tyr Tyr Asp Ser Thr Lys 
545 550 555 560 

GAT GAC CTT TCC TGG TCT AAA ACG GAC AAA TGG ATT GGA GGG GCC CCC 1728 
35 Asp Asp Leu Ser Trp Ser Lys Thr Asp Lys Trp lie Gly Gly Ala Prcr 

565 570 575 

CCG GCC GAC CAG ACC CTG GTC ATC AAG ACA TTT CGC TTC ATG TCA CAG 1776 
Pro Ala Asp Gin Thr Leu Val lie Lys Thr Phe Arg Phe Met Ser Gin 
40 580 585 590 

AAG CTC TTC ATT TCA GTC TCT GTC CTC TCC AGC CTG GGC ATT* GTC CTG 1824 
Lys Leu Phe lie Ser Val Ser Val Leu Ser Ser Leu Gly lie Val Leu 
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GCT GTG GTC TGT CTG TCC TTT AAC ATC TAC AAC TCT CAT GTC CGT TAG 1872 
Ala Val Val Cys Leu Ser Phe Asn lie Tyr Asn Ser His Val Arg Tyr 
610 615 620 

ATC CAG AAC TCC CAG CCC AAC TTG AAC AAT CTG ACT GCT GTG GGC TGC 1920 
lie Gin Asn Ser Gin Pro Asn Leu Asn Asn Leu Thr Ala Val Gly Cys 
625 ° 630 635 640 

TCC CTG GCA TTG GCT GCC GTC TTC CCC CTG GGG CTA GAT GGG TAC CAC 1968 
Ser Leu Ala Leu Ala Ala Val Phe Pro Leu Gly Leu Asp Gly Tyr His 
645 650 655 

ATC GGG AGA AGC CAG TTT CCT TTT GTG TGT CAG GCA CGC CTC TGG CTC 2016 
lie Gly Arg Ser Gin Phe Pro Phe Val Cys Gin Ala Arg Leu Trp Leu 
660 665 670 

CTG GGT CTG GGC TTC. AGT CTG GGC TAT GGC TCC ATG TTC ACG AAG ATC 2064 
Leu Gly Leu Gly Phe Ser Leu Gly Tyr Gly Ser Met Phe Thr Lys lie 
675 680 685 

TGG TGG GTC CAC ACG GTC TTC ACT AAG AAG GAG GAG AAG AAG GAG TGG 2112 
Trp Trp Val His Thr Val Phe Thr Lys Lys Glu Glu Lys Lys Glu Trp 
690 695 700 

AGG AAG ACC CTG GAG CCC TGG AAG CTG TAC ACC ACA GTG GGC TTG CTA 2160 
Arg Lys Thr Leu Glu Pro Trp Lys Leu Tyr Thr Thr Val Gly Leu Leu 
705 710 715 720 

GTG GGC ATG GAT GTC CTC ACT CTT GCC ATT TGG CAG ATG GTA GAC CCC 2208 
Val Gly Met Asp Val Leu Thr Leu Ala lie Trp Gin Met Val Asp Pro 
725 730 735 

TTG CAC CGG ACC ATT GAG ACT TTT GCC AAG GAG GAA CCA AAG GAA GAT- 2256 
Leu His Arg Thr lie Glu Thr Phe Ala Lys Glu Glu Pro Lys Glu Asp 
740 745 750 

ATT GAT GTG TCC ATC CTG CCC CAG CTG GAG CAC TGC AGC TCC AAG AAA 2304 
lie Asp Val Ser lie Leu Pro Gin Leu Glu. His Cys Ser Ser Lys Lys 
755 760 765 

ATG AAC ACC TGG CTT GGC ATT TTC TAT GGT TAC AAG GGG CTG CTG CTG 2352 
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Met Asn Thr Trp Leu Gly lie Phe Tyr Gly Tyr Lys Gly Leu Leu Leii ~ 
770 775 780 

CTG CTA GGC ATC TTT CTT GCT TAT GAG ACC AAG AGC GTG TCT ACT GAG 2400 
5 Leu Leu Gly lie Phe Leu Ala Tyr Glu Thr Lys Ser Val Ser Thr Glu 
785 790 795 800 

AAG ATC AAT GAC CAC CGG GCT GTG GGC ATG GCC ATG TAC AAC GTG GCG 2448 
Lys lie Asn Asp His Arg Ala Val Gly Met Ala Met Tyr Asn Val Ala 
10 805 ' 810 815 

GTC CTG TGC CTC ATC ACT GCC CCG GTC ACC ATG ATC CTG TCC AGC CAG 2496 

Val Leu Cys Leu lie Thr Ala Pro Val Thr Met lie Leu Ser Ser Gin 
820 825 830 

15 

CAG GAT GCA GCT TTC GCC TTT GCA GCT CTT GCC ATA GTG TTC TCC TCC 2544 

Gin Asp Ala Ala Phe Ala Phe Ala Ala Leu Ala lie Val Phe Ser Ser 
835 840 845 

20 TAC ATC ACT CTG GTC GTT CTG TTC GTG CCG AAG ATG CGC AGG TTG ATC 2592 
Tyr lie Thr Leu Val Val Leu Phe Val Pro Lys Met Arg Arg Leu lie 
850 855 860 

ACC CGG GGT GAG TGG CAG TCG GAG GCG CAG GAT ACC ATG AAA ACG GGG 2640 
25 Thr Arg Gly Glu Trp Gin Ser Glu Ala Gin Asp Thr Met Lys Thr Gly 
865 870 875 880 

TCG TCG ACC AAC AAC AAT GAG GAA GAG AAG TCC CGA CTG TTG GAG AAG 2688 
Ser Ser Thr Asn Asn Asn Glu Glu Glu Lys Ser Arg Leu Leu Glu Lys 
30 885 890 895 

GAG AAC CGG GAG CTG GAG AAG ATC ATT GCT GAG AAA GAG GAG CGA GTG 273 6 

Glu Asn Arg Glu Leu Glu Lys lie lie Ala Glu Lys Glu Glu Arg Val 
900 905 910 

35 

TCC GAG CTG CGC CAT CAG CTT CGT TCT CGG CAG CAG CTG CGC CCT CGG 2784 
Ser Glu Leu Arg His Gin Leu Arg Ser Arg Gin Gin Leu Arg Pro Arg 
915 920 925 

40 CGT CAC CCC CCG ACG CCC CCA GAC CCC TCA GGG GGC CTG CCC AGG GGA 2832 L . 

Arg His Pro Pro Thr Pro Pro Asp Pro Ser Gly Gly Leu Pro Arg Gly 
930 935 940 
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CCC CAT GAG CCC CCT GAC CGG CTC AGC TGT GAC GGG AGC CGG GTT CAC 28^0 

Pro His Glu Pro Pro Asp Arg Leu Ser Cys Asp Gly Ser Arg Val His 
945 950 955 960 

TTG CTG TAC AAG TGA 2895 
Leu Leu Tyr Lys * 
965 



35 



40 



(2) INFORMATION FOR SEQ ID NO: 53: 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 964 amino acids 
{ B) TYPE: amino acid 
15 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 53: 

20 

Met Leu Leu Leu Leu Leu Pro Leu Ala Leu Ala Pro Leu Phe Leu Arg 
! 5 10 15 

Pro Pro Gly Ala Gly Gly Ala Gin Thr Pro Asn Ala Thr Ser Glu Gly 
25 20 25 30 

Cys Gin He lie His Pro Pro Trp Glu Gly Gly He Arg Tyr Arg Gly 
35 40 45 

30 Leu Thr Arg Asp Gin Val Lys Ala He Asn Phe Leu Pro Val Asp Tyr 
50 55 60 

Glu He Glu Tyr Val Cys Arg Gly Glu Arg Glu Val Val Gly Pro Lys 
65 70 75 80 



Val Arg Lys Cys Leu Ala Asn Gly Ser Trp Thr Asp Met Asp Thr Pro 
85 90 95 

Ser Arg Cys Val Arg He Cys Ser Lys Ser Tyr Leu Ala Leu Glu Asn 
100 105 HO 

Gly Lys Val Phe Leu Thr Gly Gly Asp Leu Pro Ala Leu Asp Gly Ala 
115 120 125 
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Arg Val Asp Phe Arg Cys Asp Pro Asp Phe His Leu Val Gly Ser Ser 
130 135 140 

5 Arg Ser lie Cys Ser Gin Gly Gin Trp Ser Thr Pro Lys Pro His Cys 
145 150 155 160 

Gin Val Ser Arg Thr Pro His Ser Glu Arg Arg Ala Val Tyr He Gly 
165 170 175 

10 

Ala Leu Phe Pro Met Ser Gly Gly Trp Pro Gly Gly Gin Ala Cys Gin 
180 185 190 

Pro Ala Val Glu Met Ala Leu Glu Asp Val Asn Ser Arg Arg Asp He 
15 195 200 205 

Leu Pro Asp Tyr Glu Leu Lys Leu He His His Asp Ser Lys Cys Asp 
210 215 220 

20 Pro Gly Gin Ala Thr Lys Tyr Leu Tyr Glu Leu Leu Tyr Asn Asp Pro 
225 230 235 240 

He Lys He He Leu Met Pro Gly Cys Ser Ser Val Ser Thr Leu Val 
245 250 255 

25 

Ala Glu Ala Ala Arg Met Trp Asn Leu He Val Leu Ser Tyr Gly Ser 
260 265 270 

Ser Ser Pro Ala Leu Ser Asn Arg Gin Arg Phe Pro Thr Phe Phe Arg 
30 275 280 285 

Thr His Pro Ser Ala Thr Leu His Asn Pro Thr Arg Val Lys Leu Phe 
290 295 300 

35 Glu Lys Trp Gly Trp Arg Lys He Ala Thr He Gin Gin Thr Thr Glu rf 
305 310 315 320 

Val Phe Thr Ser Thr Leu Asp Asp Leu Glu Glu Arg Val Lys Glu Ala 
325 330 335 

40 

Gly He Glu He Thr Phe Arg Gin Ser Phe Phe Ser Asp Pro Ala Val 

340 345 350 * 
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Pro Val Lys Asn Leu Lys Arg Gin Asp Ala Arg lie lie Vai Gly Leu 
355 360 365 

Phe Tyr Glu Thr Glu Ala Arg Lys Val Phe Cys Glu Val Tyr Lys Glu 
5 370 375 380 

Arg Leu Phe Gly Lys Lys Tyr Val Trp Phe Leu lie Gly Trp Tyr Ala 
385 390 395 400 

10 Asp Asn Trp Phe Lys Thr Tyr Asp Pro Ser He Asn Cys Thr Val Asp 

405 410 415 

Glu Met Thr Glu Ala Val Glu Gly His He Thr Thr Glu He Val Met 
420 425 430 

15 

Leu Asn Pro Ala Asn Thr Arg Ser He Ser Asn Met Thr Ser Gin Glu 
435 440 445 

Phe Val Glu Lys Leu Thr Lys Arg Leu Lys Arg His Pro Glu Glu Thr 
20 450 455 460 

Gly Gly Phe Gin Glu Ala Pro Leu Ala Tyr Asp Ala He Trp Ala Leu 
465 470 475 480 

25 Ala Leu Ala Leu Asn Lys Thr Ser Gly Gly Ser Gly Arg Ser Gly Val 

485 490 495 

Arg Leu Glu Asp Phe Asn Tyr Asn Asn Gin Thr He Thr Asp Gin He 
500 505 510 

30 

Tyr Arg Ala Met Asn Ser Ser Ser Phe Glu Gly Val Ser Gly His Val 
515 520 525 

Val Phe Asp Ala Ser Gly Ser Arg Met Ala Trp Thr Leu He Glu Gin 
35 530 535 540 

Leu Gin Gly Gly Ser Tyr Lys Lys He Gly Tyr Tyr Asp Ser Thr Lys 
545 550 555 560 

40 Asp Asp Leu Ser Trp Ser Lys Thr Asp Lys Trp He Gly Gly Ala Pro 

565 570 575 

Pro Ala Asp Gin Thr Leu Val He Lys Thr Phe Arg Phe Met Ser Gin 
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580 

Lys Leu Phe lie Ser Val Ser Val 
595 600 

5 

Ala Val Val Cys Leu Ser Phe Asn 
610 615 



585 '590 

Leu Ser Ser Leu Gly lie Val Leu 
605 

lie Tyr Asn Ser His Val Arg Tyr 
620 



He Gin Asn Ser Gin Pro Asn Leu Asn Asn Leu Thr Ala Val Gly Cys 
*° 625 630' 635 640 

Ser Leu Ala Leu Ala Ala Val Phe Pro Leu Gly Leu Asp Gly Tyr His 
645 650 655 

15 He Gly Arg Ser Gin Phe Pro Phe Val Cys Gin Ala Arg Leu Trp Leu 
660 665 670 

Leu Gly Leu Gly Phe Ser Leu Gly Tyr Gly Ser Met Phe Thr Lys He 
675 680 685 

20 

Trp Trp Val His Thr Val Phe Thr Lys Lys Glu Glu Lys Lys Glu Trp 
690 695 700 

Arg Lys Thr Leu Glu Pro Trp Lys Leu Tyr Thr Thr Val Gly Leu Leu 
25 705 710 715 720 

Val Gly Met Asp Val Leu Thr Leu Ala He Trp Gin Met Val Asp Pro 
725 730 735 

30 Leu His Arg Thr He Glu Thr Phe Ala Lys Glu Glu Pro Lys Glu Asp 
740 745 750 

He Asp Val Ser He Leu Pro Gin Leu Glu His Cys Ser Ser Lys Lys 
755 760 765 

35 

Met Asn Thr Trp Leu Gly He Phe Tyr Gly Tyr Lys Gly Leu Leu Leu 
770 775 780 

Leu Leu Gly He Phe Leu Ala Tyr Glu Thr Lys Ser Val Ser Thr Glu 
40 785 790 795 800 

Lys He Asn Asp His Arg Ala Val Gly Met Ala Met Tyr Asn Val Ala 
805 810 815 
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Val Leu Cys Leu 
820 

Gin Asp Ala Ala 
835 

Tyr He Thr Leu 
850 

Thr Arg Gly Glu 
865 

Ser Ser Thr Asn 



Glu Asn Arg Glu 
900 

Ser Glu Leu Arg 
915 

Arg His Pro Pro 
930 

Pro His Glu Pro 
945 

Leu Leu Tyr Lys 



He Thr Ala Pro 



Phe Ala Phe Ala 
840 

Val Val Leu Phe 
855 

Trp Gin Ser Glu 
870 

Asn Asn Glu Glu 
885 

Leu Glu Lys He 



His Gin Leu Arg 
920 

Thr Pro Pro Asp 
935 

Pro Asp Arg Leu 
950 



Val Thr Met He 
825 

Ala Leu Ala He 



Val Pro Lys Met 
860 

Ala Gin Asp Thr 
875 

Glu Lys Ser Arg 
890 

He Ala Glu Lys 
905 

Ser Arg Gin Gin 



Pro Ser Gly Gly 
940 

Ser Cys Asp Gly 
955 



Leu Ser Ser Gin 
830 

Val Phe Ser Ser 
845 

Arg Arg Leu He 



Met Lys Thr Gly 
880 

Leu Leu Glu Lys 
895 

Glu Glu Arg Val 
910 

Leu Arg Pro Arg 
925 

Leu Pro Arg Gly 



Ser Arg Val His 
960 



(2) INFORMATION FOR SEQ ID NO: 54: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 173 7 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: cDNA to mRNA 
(iii) HYPOTHETICAL: NO 
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(iv) ANTI-SENSE: NO — 

(vi) ORIGINAL SOURCE; 

(A) ORGANISM: Homo sapiens 



(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION : 1 . .1737 

10 <xi) SEQUENCE DESCRIPTION: SEQ ID NO: 54: 

\ 

ATG TTG CTG CTG CTG CTA CTG GCG CCA CTC TTC CTC CGC CCC CCG GGC 48 
Met Leu Leu Leu Leu Leu Leu Ala Pro Leu Phe Leu Arg Pro Pro Gly 
15 10 15 

15 

GCG GGC GGG GCG CAG ACC CCC AAC GCC ACC TCA GAA GGT TGC CAG ATC 96 
Ala Gly Gly Ala Gin Thr Pro Asn Ala Thr Ser Glu Gly Cys Gin lie 
20 25 30 

20 ATA CAC CCG CCC TGG GAA GGG GGC ATC AGG TAC CGG GGC CTG ACT CGG 144 
lie His Pro Pro Trp Glu Gly Gly lie Arg Tyr Arg Gly Leu Thr Arg 
35 40 45 

GAC CAG GTG AAG GCT ATC AAC TTC CTG CCA GTG GAC TAT GAG ATT GAG 192 
25 Asp Gin Val Lys Ala lie Asn Phe Leu Pro Val Asp Tyr Glu lie Glu 
50 55 60 

TAT GTG TGC CGG GGG GAG CGC GAG GTG GTG GGG CCC AAG GTC CGC AAG 240 
Tyr Val Cys Arg Gly Glu Arg Glu Val Val Gly Pro Lys Val Arg Lys 
30 6 5 7 0 7 5 8 0 

TGC CTG GCC AAC GGC TCC TGG ACA GAT ATG GAC ACA CCC AGC CGC TGT 288 
Cys Leu Ala Asn Gly Ser Trp Thr Asp Met Asp Thr Pro Ser Arg Cys 
85 90 95 

35 

GTC CGA ATC TGC TCC AAG TCT TAT TTG ACC CTG GAA AAT GGG AAG GTT 336 
Val Arg lie Cys Ser Lys Ser Tyr Leu Thr Leu Glu Asn Gly Lys Val 
100 105 110 

40 TTC CTG ACG GGT GGG GAC CTC CCA GCT CTG GAC GGA GCC CGG GTG GAT 3 84., 

Phe Leu Thr Gly Gly Asp Leu Pro Ala Leu Asp Gly Ala Arg Val Asp 
115 120 125 
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TTC CGG TGT GAC CCC GAC TTC CAT CTG GTG GGC AGC TCC CGG AGC ATC 4T2 
Phe Arg Cys Asp Pro Asp Phe His Leu Val Gly Ser Ser Arg Ser lie 
130 135 140 

5 TGT AGT CAG GGC CAG TGG AGC ACC CCC AAG CCC CAC TGC CAG GTG AAT 480 
Cys Ser Gin Gly Gin Trp Ser Thr Pro Lys Pro His Cys Gin Val Asn 
145 150 155 160 

CGA ScG CCA CAC TCA GAA CGG CGC GCA GTG TAC ATC GGG GCA CTG TTT 528 
10 Arg Thr Pro His Ser Glu Arg Arg Ala Val Tyr lie Gly Ala Leu Phe 

165 170 175 

CCC ATG AGC GGG GGC TGG CCA GGG GGC CAG GCC TGC CAG CCC GCG GTG 576 
Pro Met Ser Gly Gly Trp Pro Gly Gly Gin Ala Cys Gin Pro Ala Val 
15 180 185 190 

GAG ATG GCG CTG GAG GAC GTG AAT AGC CGC AGG GAC ATC CTG CCG GAC 624 

Glu Met Ala Leu Glu Asp Val Asn Ser Arg Arg Asp lie Leu Pro Asp 
195 200 205 

20 

TAT GAG CTC AAG CTC ATC CAC CAC GAC AGC AAG TGT GAT CCA GGC CAA 672 

Tyr Glu Leu Lys Leu lie His His Asp Ser Lys Cys Asp Pro Gly Gin 
210 215 220 

25 GCC ACC AAG TAC CTA TAT GAG CTG CTC TAC AAC GAC CCT ATC AAG ATC 720 
Ala Thr Lys Tyr Leu Tyr Glu Leu Leu Tyr Asn Asp Pro lie Lys lie 
225 230 235 240 

ATC CTT ATG CCT GGC TGC AGC TCT GTC TCC ACG CTG GTG GCT GAG GCT 768 
30 He Leu Met Pro Gly Cys Ser Ser Val Ser Thr Leu Val Ala Glu Ala 

245 250 255 

GCT AGG ATG TGG AAC CTC ATT GTG CTT TCC TAT GGC TCC AGC TCA CCA 816 
Ala Arg Met Trp Asn Leu lie Val Leu Ser Tyr Gly Ser Ser Ser Pro 
260 265 270 



35 



40 



GCC CTG TCA AAC CGG CAG CGT TTC CCC ACT TTC TTC CGA ACG CAC CCA 864 
Ala Leu Ser Asn Arg Gin Arg Phe Pro Thr Phe Phe Arg Thr His Pro 
275 ^ 280 285 

TCA GCC AC A CTC CAC AAC CCT ACC CGC GTG AAA CTC TTT GAA AAG TGG 912 
Ser Ala Thr Leu His Asn Pro Thr Arg Val Lys Leu Phe Glu Lys Trp 
290 295 300 
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GGC TGG AAG AAG ATT GCT ACC ATC CAG CAG ACC ACT GAG GTC TTC ACT 960 
Gly Trp Lys Lys lie Ala Thr lie Gin Gin Thr Thr Glu Val Phe Thr 
305 310 315 320 

TCG ACT CTG GAC GAC CTG GAG GAA CGA GTG AAG GAG GCT GGA ATT GAG 1008 
Ser Thr Leu Asp Asp Leu Glu Glu Arg Val Lys Glu Ala Gly lie Glu 
325 330 335 

ATT ACT TTC CGC CAG AGT TTC TTC TCA GAT CCA GCT GTG CCC GTC AAA 105 6 

lie Thr Phe Arg Gin Ser Phe Phe Ser Asp Pro Ala Val Pro Val Lys 
340 345 350 

AAC CTG AAG CGC CAG GAT GCC CGA ATC ATC GTG GGA CTT TTC TAT GAG 1104 
Asn Leu Lys Arg Gin Asp Ala Arg lie lie Val Gly Leu Phe Tyr Glu 
355 360 365 

ACT GAA GCC CGG AAA GTT TTT TGT GAG GTG TAC AAG GAG CGT CTC TTT 1152 
Thr Glu Ala Arg Lys Val Phe Cys Glu Val Tyr Lys Glu Arg Leu Phe 
370 375 380 

GGG AAG AAG TAC GTC TGG TTC CTC ATT GGG TGG TAT GCT GAC AAT TGG 12 00 

Gly Lys Lys Tyr Val Trp Phe Leu He Gly Trp Tyr Ala Asp Asn Trp 
385 390 395 400 

TTC AAG ATC TAC GAC CCT TCT ATC AAC TGC ACA GTG GAT GAG ATG ACT 1248 
Phe Lys He Tyr Asp Pro Ser He Asn Cys Thr Val Asp Glu Met Thr 
405 410 415 

GAG GCG GTG GAG GGC CAC ATC ACA ACT GAG ATT GTC ATG CTG AAT CCT 1296 
Glu Ala Val Glu Gly His He Thr Thr Glu He Val Met Leu Asn Pro 
420 425 430 

GCC AAT ACC CGC AGC ATT TCC AAC ATG ACA TCC CAG GAA TTT GTG GAG 1344 
Ala Asn Thr Arg Ser He Ser Asn Met Thr Ser Gin Glu Phe Val Glu" 
435 440 445 

AAA CTA ACC AAG CGA CTG AAA AGA CAC CCT GAG GAG ACA GGA GGC TTC 1392 
Lys Leu Thr Lys Arg Leu Lys Arg His Pro Glu Glu Thr Gly Gly Phe 
450 455 460 

CAG GAG GCA CCG CTG GCC TAT GAT GCC ATC TGG GCC TTG GCA* CTG GCC 1440 
Gin Glu Ala Pro Leu Ala Tyr Asp Ala He Trp Ala Leu Ala Leu Ala 
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465 



470 475 480 



CTG AAC AAG ACA TCT GGA GGA GGC GGC CGT TCT GGT GTG CGC CTG GAG 1488 
Leu Asn Lys Thr Ser Gly Gly Gly Gly Arg Ser Gly Val Arg Leu Glu 
485 490 495 

GAC TTC AAC TAC AAC AAC CAG ACC ATT ACC GAC CAA ATC TAC CGG GCA 153 6 

Asp Phe Asn Tyr Asn Asn Gin Thr lie Thr Asp Gin lie Tyr Arg Ala 
500 505 510 

ATG AAC TCT TCG TCC TTT GAG GGT GTC TCT GGC CAT GTG GTG TTT GAT 1584 
Met Asn Ser Ser Ser Phe Glu Gly Val Ser Gly His Val Val Phe Asp 
515 520 525 

GCC AGC GGC TCT CGG ATG GCA TGG ACG CTT ATC GAG CAG CTT CAG GGT 1632 
Ala Ser Gly Ser Arg Met Ala Trp Thr Leu lie Glu Gin Leu Gin Gly 
530 535 540 

GGC AGC TAC AAG AAG ATT GGC TAC TAT GAC AGC ACC AAG GAT GAT CTT 1680 
Gly Ser Tyr Lys Lys lie Gly Tyr Tyr Asp Ser Thr Lys Asp Asp Leu 
545 550 555 560 

TCC TGG TCC AAA ACA GAT AAA TGG ATT GTT ATA TCC AGA ACT CAC AGC 1728 
Ser Trp Ser Lys Thr Asp Lys Trp lie Val He Ser Arg Thr His Ser 
565 570 575 



CCA ACC TGA 
Pro Thr * 



1737 



(2) INFORMATION FOR SEQ ID NO : 55: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 57 8 amino acids 

(B) TYPE: amino acid 
<D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 55: 
Met Leu Leu Leu Leu Leu Leu Ala Pro Leu Phe Leu Arg Pro Pro Gly 
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1 5 10 15 ~ 

Ala Gly Gly Ala Gin Thr Pro Asn Ala Thr Ser Glu Gly Cys Gin He 
20 25 30 

5 

He His Pro Pro Trp Glu Gly Gly He Arg Tyr Arg Gly Leu Thr Arg 
35 40 45 

Asp Gin Val Lys Ala He Asn Phe Leu Pro Val Asp Tyr Glu He Glu o 
10 50 55 60 

Tyr Val Cys Arg Gly Glu Arg Glu Val Val Gly Pro Lys Val Arg Lys 
65 70 75 80 

15 Cys Leu Ala Asn Gly Ser Trp Thr Asp Met Asp Thr Pro Ser Arg Cys 

85 90 95 

Val Arg He Cys Ser Lys Ser Tyr Leu Thr Leu Glu Asn Gly Lys Val 
100 105 110 

20 

Phe Leu Thr Gly Gly Asp Leu Pro Ala Leu Asp Gly Ala Arg Val Asp 
115 120 125 

Phe Arg Cys Asp Pro Asp Phe His Leu Val Gly Ser Ser Arg Ser He 
25 130 135 140 

Cys Ser Gin Gly Gin Trp Ser Thr Pro Lys Pro His Cys Gin Val Asn 
145 150 155 160 

30 Arg Thr Pro His Ser Glu Arg Arg Ala Val Tyr He Gly Ala Leu Phe 

165 170 175 

Pro Met Ser Gly Gly Trp Pro Gly Gly Gin Ala Cys Gin Pro Ala Val 
180 185 190 

35 

Glu Met Ala Leu Glu Asp Val Asn Ser Arg Arg Asp He Leu Pro Asp 
195 200 205 

Tyr Glu Leu Lys Leu He His His Asp Ser Lys Cys Asp Pro Gly Gin 
40 210 215 220 

Ala Thr Lys Tyr Leu Tyr Glu Leu Leu Tyr Asn Asp Pro He Lys He 
225 230 235 240 
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He Leu Met Pro Gly Cys Ser Ser Val Ser Thr Leu Val Ala Glu Ala 
245 250 255 

Ala Arg Met Trp Asn Leu He Val Leu Ser Tyr Gly Ser Ser Ser Pro 
260 265 270 

Ala Leu Ser Asn Arg Gin Arg Phe Pro Thr Phe Phe Arg Thr His Pro 
^ 275 280 285 

Ser Ala Thr Leu His Asn Pro Thr Arg Val Lys Leu Phe Glu Lys Trp 
290 295 300 

Gly Trp Lys Lys He Ala Thr He Gin Gin Thr Thr Glu Val Phe Thr 
305 310 315 320 

Ser Thr Leu Asp Asp Leu Glu Glu Arg Val Lys Glu Ala Gly He Glu 
325 330 335 

He Thr Phe Arg Gin Ser Phe Phe Ser Asp Pro Ala Val Pro Val Lys 
340 345 350 

Asn Leu Lys Arg Gin Asp Ala Arg He He Val Gly Leu Phe Tyr Glu 
355 360 365 

Thr Glu Ala Arg Lys Val Phe Cys Glu Val Tyr Lys Glu Arg Leu Phe 
370 375 380 

Gly Lys Lys Tyr Val Trp Phe Leu He Gly Trp Tyr Ala Asp Asn Trp 
385 390 395 400 

Phe Lys He Tyr Asp Pro Ser He Asn Cys Thr Val Asp Glu Met Thr 
405 410 415 

Glu Ala Val Glu Gly His He Thr Thr Glu He Val Met Leu Asn Pro 
420 425 430 

Ala Asn Thr Arg Ser He Ser Asn Met Thr Ser Gin Glu Phe Val Glu 
435 440 445 

Lys Leu Thr Lys Arg Leu Lys Arg His Pro Glu Glu Thr Gly Gly Phe 
450 455 460 
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Gin Glu Ala Pro Leu 
465 

Leu Asn Lys Thr Ser 

5 485 

Asp Phe Asn Tyr Asn 
500 

!0 Met Asn Ser Ser Ser 
515 

Ala Ser Gly Ser Arg 
530 

15 

Gly Ser Tyr Lys Lys 
545 

Ser Trp Ser Lys Thr 
20 565 

Pro Thr 



Ala Tyr Asp Ala He 
470 

Gly Gly Gly Gly Arg 
490 

Asn Gin Thr He Thr 
505 

Phe Glu Gly Val Ser 
520 

Met Ala Trp Thr Leu 
535 

He Gly Tyr Tyr Asp 
550 

Asp Lys Trp He Val 
570 



Trp Ala Leu Ala Leu Ala 
475 480 

Ser Gly Val Arg Leu Glu 
495 

Asp Gin He Tyr Arg Ala 
510 

Gly His Val Val Phe Asp 
525 

He Glu Gin Leu Gin Gly 
540 

Ser Thr Lys Asp Asp Leu 
555 560 

He Ser Arg Thr His Ser 
575 



25 



(2) INFORMATION FOR SEQ ID NO: 56: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1386 base pairs 
30 (B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA to mRNA 

35 

(iii) HYPOTHETICAL: NO 

(iv) ANTI- SENSE: NO 

40 (vi) ORIGINAL SOURCE: 

(A) ORGANISM : Homo sapiens 

(ix) FEATURE: 
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(A) NAME/ KEY : CDS 

(B) LOCATION : 1 . .13 86 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 56: 



TGC AGC TCT GTC TCC ACG CTG GTG GCT GAG GCT GCT AGG ATG TGG AAC 
Cys Ser Ser Val Ser Thr Leu Val Ala Glu Ala Ala Arg Met Trp Asn 
130 135 140 

CTC ATT GTG CTT TCC TAT GGC TCC AGC TCA CCA GCC CTG TCA* AAC CGG 
Leu He Val Leu Ser Tyr Gly Ser Ser Ser Pro Ala Leu Ser Asn Arg 



48 



96 



ATG GGG CCC GGG GCC CCT TTT GCC CGG GTG GGG TGG CCA CTG CCG CTT 
Met Gly Pro Gly Ala Pro Phe Ala Arg Val Gly Trp Pro Leu Pro Leu 
-L 5 10 15 

CTG GTT GTG ATG GCG GCA GGG GTG GCT CCG GTG TGG GCC TCC CAC TCC 
Leu Val Val Met Ala Ala Gly Val Ala Pro Val Trp Ala Ser His Ser 
20 25 30 

CCC CAT CTC CCG CGG CCT CAC TCG CGG GTC CCC CCG CAC CCC TCC TCA 144 
Pro His Leu Pro Arg Pro His Ser Arg Val Pro Pro His Pro Ser Ser 
35 40 45 

GAA CGG CGC GCA GTG TAC ATC GGG GCA CTG TTT CCC ATG AGC GGG GGC 192 
Glu Arg Arg Ala Val Tyr He Gly Ala Leu Phe Pro Met Ser Gly Gly 
50 55 60 

TGG CCA GGG GGC CAG GCC TGC CAG CCC GCG GTG GAG ATG GCG CTG GAG 240 
Trp Pro Gly Gly Gin Ala Cys Gin Pro Ala Val Glu Met Ala Leu Glu 
65 70 75 80 

GAC GTG AAT AGC CGC AGG GAC ATC CTG CCG GAC TAT GAG CTC AAG CTC 288 
Asp Val Asn Ser Arg Arg Asp He Leu Pro Asp Tyr Glu Leu Lys Leu 
85 90 95 

ATC CAC CAC GAC AGC AAG TGT GAT CCA GGC CAA GCC ACC AAG TAC CTA 3 36 

He His His Asp Ser Lys Cys Asp Pro Gly Gin Ala Thr Lys Tyr Leu 
100 105 HO 

TAT GAG CTG CTC TAC AAC GAC CCT ATC AAG ATC ATC CTT, ATG CCT GGC 3 84 

Tyr Glu Leu Leu Tyr Asn Asp Pro He Lys He He Leu Met Pro Gly 
115 120 125 



432 



480 
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145 150 155 160 ~ 

CAG CGT TTC CCC ACT TTC TTC CGA ACG CAC CCA TCA GCC ACA CTC CAC 528 
Gin Arg Phe Pro Thr Phe Phe Arg Thr His Pro Ser Ala Thr Leu His 
165 170 175 

AAC CCT ACC CGC GTG AAA CTC TTT GAA AAG TGG GGC TGG AAG AAG ATT 576 
Asn Pro Thr Arg Val Lys Leu Phe Glu Lys Trp Gly Trp Lys Lys lie 
180 185 190 

GCT ACC ATC CAG CAG ACC ACT GAG GTC TTC ACT TCG ACT CTG GAC GAC 624 
Ala Thr lie Gin Gin Thr Thr Glu Val Phe Thr Ser Thr Leu Asp Asp 
195 200 205 

15 CTG GAG GAA CGA GTG AAG GAG GCT GGA ATT GAG ATT ACT TTC CGC CAG 672 
Leu Glu Glu Arg Val Lys Glu Ala Gly lie Glu lie Thr Phe Arg Gin 
210 215 220 

AGT TTC TTC TCA GAT CCA GCT GTG CCC GTC AAA AAC CTG AAG CGC CAG 720 
20 Ser Phe Phe Ser Asp Pro Ala Val Pro Val Lys Asn Leu Lys Arg Gin 
225 230 235 240 

GAT GCC CGA ATC ATC GTG GGA CTT TTC TAT GAG ACT GAA GCC CGG AAA 768 
Asp Ala Arg lie lie Val Gly Leu Phe Tyr Glu Thr Glu Ala Arg Lys 
25 245 250 255 

GTT TTT TGT GAG GTG TAC AAG GAG CGT CTC TTT GGG AAG AAG TAC GTC 816 

Val Phe Cys Glu Val Tyr Lys Glu Arg Leu Phe Gly Lys Lys Tyr Val 
260 265 270 

30 

TGG TTC CTC ATT GGG TGG TAT GCT GAC AAT TGG TTC AAG ATC TAC GAC 864 

Trp Phe Leu lie Gly Trp Tyr Ala Asp Asn Trp Phe Lys He Tyr Asp 
275 280 285 

35 CCT TCT ATC AAC TGC ACA GTG GAT GAG ATG ACT GAG GCG GTG GAG GGC* 912 
Pro Ser He Asn Cys Thr Val Asp Glu Met Thr Glu Ala Val Glu Gly 
290 295 300 

CAC ATC ACA ACT GAG ATT GTC ATG CTG AAT CCT GCC AAT ACC CGC AGC 960 
40 His He Thr Thr Glu He Val Met Leu Asn Pro Ala Asn Thr Arg Ser 
305 310 315 320 

ATT TCC AAC ATG ACA TCC CAG GAA TTT GTG GAG AAA CTA ACC AAG CGA 1008 
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lie Ser Asn Met Thr Ser Gin Glu Phe Val Glu Lys Leu Thr Lys' Arg 
325 330 335 

CTG AAA AGA CAC CCT GAG GAG ACA GGA GGC TTC CAG GAG GCA CCG CTG 1056 
Leu Lys Arg His Pro Glu Glu Thr Gly Gly Phe Gin Glu Ala Pro Leu 
340 345 350 

GCC TAT GAT GCC ATC TGG GCC TTG GCA CTG GCC CTG AAC AAG ACA TCT 1104 
Ala Tyr Asp Ala lie Trp Ala Leu Ala Leu Ala Leu Asn Lys Thr Ser 
355 360 365 

GGA GGA GGC GGC CGT TCT GGT GTG CGC CTG GAG GAC TTC AAC TAC AAC 1152 
Gly Gly Gly Gly Arg Ser Gly Val Arg Leu Glu Asp Phe Asn Tyr Asn 
370 375 380 

•AAC CAG ACC ATT ACC GAC CAA ATC TAC CGG GCA ATG AAC TCT TCG TCC 1200 
Asn Gin Thr lie Thr Asp Gin lie Tyr Arg Ala Met Asn Ser Ser Ser 
385 390 395 400 

TTT GAG GGT GTC TCT GGC CAT GTG GTG TTT GAT GCC AGC GGC TCT CGG 1248 
Phe Glu Gly Val Ser Gly His Val Val Phe Asp Ala Ser Gly Ser Arg 
405 410 415 

ATG GCA TGG ACG CTT ATC GAG CAG CTT CAG GGT GGC AGC TAC AAG AAG 1296 
Met Ala Trp Thr Leu lie Glu Gin Leu Gin Gly Gly Ser Tyr Lys Lys 
420 425 430 

ATT GGC TAC TAT GAC AGC ACC AAG GAT GAT CTT TCC TGG TCC AAA ACA 13 44 

lie Gly Tyr Tyr Asp Ser Thr Lys Asp Asp Leu Ser Trp Ser Lys Thr 
435 440 445 

GAT AAA TGG ATT GTT ATA TCC AGA ACT CAC AGC CCA ACC TGA 13 86 

Asp' Lys Trp lie Val He Ser Arg Thr His Ser Pro Thr * 
450 455 460 



(2) INFORMATION FOR SEQ ID NO: 57: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 461 amino acids 

(B) TYPE: amino acid 
( D ) TOPOLOGY : 1 inear 
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(ii) MOLECULE TYPE: protein — 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 57: 

Met Gly Pro Gly Ala Pro Phe Ala Arg Val Gly Trp Pro Leu Pro Leu 
5 1 5 10 15 

Leu Val Val Met Ala Ala Gly Val Ala Pro Val Trp Ala Ser His Ser 
20 25 30 

10 Pro His Leu Pro Arg Pro His Ser Arg Val Pro Pro His Pro Ser Ser 
35 40 45 

Glu Arg Arg Ala Val Tyr lie Gly Ala Leu Phe Pro Met Ser Gly Gly 
50 55 60 

15 

Trp Pro Gly Gly Gin Ala Cys Gin Pro Ala Val Glu Met Ala Leu Glu 
65 70 75 80 

Asp Val Asn Ser Arg. Arg Asp lie Leu Pro Asp Tyr Glu Leu Lys Leu 
20 85 90 95 

He His His Asp Ser Lys Cys Asp Pro Gly Gin Ala Thr Lys Tyr Leu 
100 105 110 

25 Tyr Glu Leu Leu Tyr Asn Asp Pro He Lys He He Leu Met Pro Gly 
115 120 125 

Cys Ser Ser Val Ser Thr Leu Val Ala Glu Ala Ala Arg Met Trp Asn 
130 135 140 

30 

Leu He Val Leu Ser Tyr Gly Ser Ser Ser Pro Ala Leu Ser Asn Arg 
145 150 155 160 

Gin Arg Phe Pro Thr Phe Phe Arg Thr His Pro Ser Ala Thr Leu His 
35 165 170 175 

Asn Pro Thr Arg Val Lys Leu Phe Glu Lys Trp Gly Trp Lys Lys He 
180 185 190 

40 Ala Thr He Gin Gin Thr Thr Glu Val Phe Thr Ser Thr Leu Asp Asp 
195 200 205 

Leu Glu Glu Arg Val Lys Glu Ala Gly He Glu He Thr Phe Arg Gin 
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210 215 220 

Ser Phe Phe Ser Asp Pro Ala Val Pro Val Lys Asn Leu Lys Arg Gin 
225 230 235 240 

Asp Ala Arg lie lie Val Gly Leu Phe Tyr Glu Thr Glu Ala Arg Lys 
245 250 255 

Val Phe Cys Glu Val Tyr Lys Glu Arg Leu Phe Gly Lys Lys Tyr Val 
260 265 270 

Trp Phe Leu lie Gly Trp Tyr Ala Asp Asn Trp Phe Lys lie Tyr Asp 
275 280 285 

Pro Ser He Asn Cys Thr Val Asp Glu Met Thr Glu Ala Val Glu Gly 
290 295 300 

His He Thr Thr Glu He Val Met Leu Asn Pro Ala Asn Thr Arg Ser 
305 .310 315 320 

He Ser Asn Met Thr Ser Gin Glu Phe Val Glu Lys Leu Thr Lys Arg 
325 330 335 

Leu Lys Arg His Pro Glu Glu Thr Gly Gly Phe Gin Glu Ala Pro Leu 
340 345 350 

Ala Tyr Asp Ala He Trp Ala Leu Ala Leu Ala Leu Asn Lys Thr Ser 
355 360 365 

Gly Gly Gly Gly Arg Ser Gly Val Arg Leu Glu Asp Phe Asn Tyr Asn 
370 375 380 

Asn Gin Thr He Thr Asp Gin He Tyr Arg Ala Met Asn Ser Ser Ser 
385 390 395 400 

Phe Glu Gly Val Ser Gly His Val Val Phe Asp Ala Ser Gly Ser Arg 
405 410 415 

Met Ala Trp Thr Leu He Glu Gin Leu Gin Gly Gly Ser Tyr Lys Lys 
420 425 430 

He Gly Tyr Tyr Asp Ser Thr Lys Asp Asp Leu Ser Trp Ser Lys Thr 
435 440 445 
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Asp Lys Trp lie Val lie Ser Arg Thr His Ser Pro Thr 
450 455 460 



(2) INFORMATION FOR SEQ ID NO: 58: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1746 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA to mRNA 



(iii) HYPOTHETICAL: NO 



(iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Canis familiaris 



(ix) FEATURE: 

(A) NAME /KEY: CDS 

(B) LOCATION : 1 . .1746 



<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 58: 



ATG CTG CTG CTG CTC CTG CCG CTG GCG CTG GCG CCG CTC TTC CTC CGC 48 

Met Leu Leu Leu Leu Leu Pro Leu Ala Leu Ala Pro Leu Phe Leu Arg 

1 5 10 15 

CCC CCG GGC GCG GGC GGG GCA CAG ACC CCC AAC GCC ACC TCG GAA GGT 96 

Pro Pro Gly Ala Gly Gly Ala Gin Thr Pro Asn Ala Thr Ser Glu Gly 

20 25 30 

TGC CAG ATC ATA CAC CCG CCT TGG GAA GGG GGT ATC AGG TAC AGG GGC 144 

Cys Gin lie lie His Pro Pro Trp Glu Gly Gly lie Arg Tyr Arg Gly 

35 40 45 

CTG ACT CGT GAC CAG GTG AAG GCT ATC AAC TTC CTG CCG GTG GAC TAT 192 

Leu Thr Arg Asp Gin Val Lys Ala lie Asn Phe Leu Pro Val *Asp Tyr 
50 55 60 
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GAG ATT GAG TAT GTG TGC CGG GGA GAG CGA GAG GTG GTG GGG CCC AAG 240 
Glu lie Glu Tyr Val Cys Arg Gly Glu Arg Glu Val Val Gly Pro Lys 
65 70 75 80 

GTC CGA AAG TGC CTG GCC AAT GGC TCC TGG ACA GAT ATG GAC ACA CCC 288 
Val Arg Lys Cys Leu Ala Asn Gly Ser Trp Thr Asp Met Asp Thr Pro 
85 90 95 

AGC CGC TGT GTC CGA ATC TGT TCC AAG TCA TAT TTG GCC CTG GAA AAT 336 
Ser Arg Cys Val Arg lie Cys Ser Lys Ser Tyr Leu Ala Leu Glu Asn 
100 105 110 

GGG AAG GTC TTC CTG ACG GGT GGG GAC CTC CCC GCT CTG GAT GGA GCC 3 84 

Gly Lys Val Phe Leu Thr Gly Gly Asp Leu Pro Ala Leu Asp Gly Ala 
115 120 125 

CGG GTG GAT TTC CGG TGT GAC CCT GAC TTC CAT CTT GTG GGC AGC TCC 432 
Arg Val Asp Phe Arg Cys Asp Pro Asp Phe His Leu Val Gly Ser Ser 
130 135 140 

CGG AGT ATC TGT AGT CAG GGC CAG TGG AGC ACT CCC AAG CCC CAC TGC 480 
Arg Ser lie Cys Ser Gin Gly Gin Trp Ser Thr Pro Lys Pro His Cys 
145 150 155 160 

CAG GTG AGC CGA ACG CCG CAC TCA GAG CGG CGA GCG GTG TAC ATC GGG 528 
Gin Val Ser Arg Thr Pro His Ser Glu Arg Arg Ala Val Tyr lie Gly 
165 170 175 

GCG CTG TTT CCC ATG AGC GGG GGC TGG CCG GGG GGC CAG GCC TGC CAG 576 
Ala Leu Phe Pro Met Ser Gly Gly Trp Pro Gly Gly Gin Ala Cys Gin 
180 185 190 

CCC GCG GTG GAG ATG GCG CTG GAG GAC GTG AAT AGC CGC AGG GAC ATC 624 
Pro Ala Val Glu Met Ala Leu Glu Asp Val Asn Ser Arg Arg Asp Iler 
195 200 205 

CTG CCG GAC TAC GAG CTC AAG CTC ATC CAC CAC. GAC AGC AAG TGT GAC 672 
Leu Pro Asp Tyr Glu Leu Lys Leu lie His His Asp Ser Lys Cys Asp 
210 215 220 

CCA GGC CAA GCT ACC AAG TAC CTG TAT GAA CTG CTC TAC AAC GAC CCC 72 0 

Pro Gly Gin Ala Thr Lys Tyr Leu Tyr Glu Leu Leu Tyr Asn Asp Pro 
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225 230 235 240 



ATC AAG ATC ATC CTC ATG CCT GGC TGC AGC TCT GTC TCC ACG CTT GTG 768 
lie Lys lie lie Leu Met Pro Gly Cys Ser Ser Val Ser Thr Leu Val 
5 245 250 255 



GCT GAG GCT GCC AGG ATG TGG AAC CTC ATT GTG CTC TCC TAT GGT TCC 816 

Ala Glu Ala Ala Arg Met Trp Asn Leu lie Val Leu Ser Tyr Gly Ser 
260 265 270 

10 

AGC TCA CCA GCT CTG TCC AAC CGG CAG CGC TTT CCT ACC TTC TTC CGA 864 

Ser Ser Pro Ala Leu Ser Asn Arg Gin Arg Phe Pro Thr Phe Phe Arg 
275 280 285 



15 ACT CAT CCC TCG GCC ACG CTC CAC AAC CCT ACG CGA GTG AAG CTC TTT 912 
Thr His Pro Ser Ala Thr Leu His Asn Pro Thr Arg Val Lys Leu Phe 
290 295 300 



GAG AAG TGG GGC TGG .AGG AAG ATT GCC ACC ATC CAG CAG ACC ACC GAG 960 
20 Glu Lys Trp Gly Trp Arg Lys He Ala Thr He Gin Gin Thr Thr Glu 
305 310 315 320 



GTG TTC ACA TCG ACT CTG GAC GAC CTA GAG GAA CGA GTG AAG GAG GCT 1008 
Val Phe Thr Ser Thr Leu Asp Asp Leu Glu Glu Arg Val Lys Glu Ala 
25 325 330 335 



GGG ATT GAG ATT ACT TTC CGC CAG AGC TTC TTC TCA GAT CCT GCC GTG 1056 
Gly lie Glu He Thr Phe Arg Gin Ser Phe Phe Ser Asp Pro Ala Val 
340 345 350 

30 

CCT GTC AAG AAC CTC AAG CGC CAG GAT GCC CGA ATC ATC GTG GGA CTT 1104 
Pro Val Lys Asn Leu Lys Arg Gin Asp Ala Arg He He Val Gly Leu 
355 360 365 



35 TTC TAT GAG ACT GAA GCC 
Phe Tyr Glu Thr Glu Ala 
370 

CGG CTC TTT GGG AAG AAG 
40 Arg Leu Phe Gly Lys Lys 
385 390 

GAC AAT TGG TTC AAG ACC 



CGG AAA GTG TTC TGT GAG 

Arg Lys Val Phe Cys Glu 

375 380 

TAT GTG TGG TTC CTC ATT 

Tyr Val Trp Phe Leu He 
395 

TAC GAC CCC TCC ATC AAC 



GTA TAC AAG GAG - 1152 
Val Tyr Lys Glu 

GGG TGG TAT GCT 1200 
Gly Trp Tyr Ala 
400 

TGC ACA GTG GAT 1248 
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Asp Asn Trp Phe Lys Thr Tyr Asp Pro Ser He Asn Cys Thr Val Asp ~ 
405 410 415 

GAG ATG ACC GAG GCT GTG GAA GGC CAC ATC ACC ACT GAG ATT GTC ATG 1296 
Glu Met Thr Glu Ala Val Glu Gly His He Thr Thr Glu He Val Met 
420 425 430 

CTG AAC CCA GCC AAC ACC CGC AGC ATC TCC AAC ATG ACA TCC CAG GAG 1344 
Leu Asn Pro Ala Asn Thr Arg Ser He Ser Asn Met Thr Ser Gin Glu 
435 440 445 

TTT GTG GAG AAA CTG ACC AAG AGA CTC AAG AGA CAC CCT GAG GAG ACA 13 92 

Phe Val Glu Lys Leu Thr Lys Arg Leu Lys Arg His Pro Glu Glu Thr 
450 455 460 

GGC GGC TTC CAG GAG GCA CCG CTG GCC TAT GAT GCC ATC TGG GCC TTG 1440 
Gly Gly Phe Gin Glu Ala Pro Leu Ala Tyr Asp Ala He Trp Ala Leu 
465 470 475 480 

GCA TTG GCC CTG AAC AAG ACA TCT GGA GGG AGC GGC CGT TCG GGG GTG 1488 
Ala Leu Ala Leu Asn Lys Thr Ser Gly Gly Ser Gly Arg Ser Gly Val 
485 490 495 

CGC CTG GAA GAC TTC AAC TAC AAC AAC CAG ACG ATC ACA GAC CAA ATC 1536 
Arg Leu Glu Asp Phe Asn Tyr Asn Asn Gin Thr He Thr Asp Gin He 
500 505 510 

TAC CGC GCA ATG AAC TCC TCG TCC TTT GAG GGT GTC TCT GGC CAC GTG 1584 
Tyr Arg Ala Met Asn Ser Ser Ser Phe Glu Gly Val Ser Gly His Val 
515 520 525 

GTG TTT GAT GCC AGC GGC TCA CGG ATG GCC TGG ACT CTG ATT GAG CAG 163 2 

Val Phe Asp Ala Ser Gly Ser Arg Met Ala Trp Thr Leu He Glu Gin 
530 535 540 

CTG CAG GGT GGC AGC TAC AAG AAG ATC GGC TAC TAT GAC AGC ACC AAG 1680 
Leu Gin Gly Gly Ser Tyr Lys Lys He Gly Tyr Tyr Asp Ser Thr Lys 
545 550 555 560 

GAT GAC CTT TCC TGG TCT AAA ACG GAC AAA TGG ATT GTT ACA TCC AGA 172 8 . 

Asp Asp Leu Ser Trp Ser Lys Thr Asp Lys Trp He Val Thr Ser Arg 
565 570 *575 
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ACT CCC AGC CCA ACT TGA 17^ 
Thr Pro Ser Pro Thr * 
580 

5 

(2) INFORMATION FOR SEQ ID NO : 59: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 581 amino acids 
10 (B) TYPE: amino acid 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

15 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 59: 

Met Leu Leu Leu Leu Leu Pro Leu Ala Leu Ala Pro Leu Phe Leu Arg 
1 5 10 -15 

20 

Pro Pro Gly Ala Gly Gly Ala Gin Thr Pro Asn Ala Thr Ser Glu Gly 
20 25 30 

Cys Gin lie lie His Pro Pro Trp Glu Gly Gly lie Arg Tyr Arg Gly 
25 35 40 45 

Leu Thr Arg Asp Gin Val Lys Ala lie Asn Phe Leu Pro Val Asp Tyr 
50 55 60 

30 Glu lie Glu Tyr Val Cys Arg Gly Glu Arg Glu Val Val Gly Pro Lys 
65 70 75 80 

Val Arg Lys Cys Leu Ala Asn Gly Ser Trp Thr Asp Met Asp Thr Pro 
85 90 95 

35 

Ser Arg Cys Val Arg lie Cys Ser Lys Ser Tyr Leu Ala Leu Glu Asn 
100 105 110 

Gly Lys Val Phe Leu Thr Gly Gly Asp Leu Pro Ala Leu Asp Gly Ala 
40 115 120 125 

Arg Val Asp Phe Arg Cys Asp Pro Asp Phe His Leu Val Gly Ser Ser 
130 135 140 

HI 865-1 WO SEQ 85 



SUBSTITUTE SHEET (RULE 26) 



WO 99/21890 



PCT/SE98/01947 



Arg Ser He Cys 
145 

Gin Val Ser Arg 



Ser Gin Gly Gin 
150 

Thr Pro His Ser 
165 



Trp Ser Thr Pro 
155 

Glu Arg Arg Ala 
170 



Lys Pro His Cys 
160 

Val Tyr He Gly 
175 



Ala Leu Phe Pro Met Ser Gly Gly Trp Pro Gly Gly Gin Ala Cys Gin 
" 180 185 190 

Pro Ala Val Glu Met Ala Leu Glu Asp Val Asn Ser Arg Arg Asp He 
195 200 205 

Leu Pro Asp Tyr Glu Leu Lys Leu He His His Asp Ser Lys Cys Asp 
210 215 220 

Pro Gly Gin Ala Thr Lys Tyr Leu Tyr Glu Leu Leu Tyr Asn Asp Pro 
225 230 235 240 

He Lys He He Leu Met Pro Gly Cys Ser Ser Val Ser Thr Leu Val 
245 250 255 

Ala Glu Ala Ala Arg Met Trp Asn Leu He Val Leu Ser Tyr Gly Ser 
260 265 270 

Ser Ser Pro Ala Leu Ser Asn Arg Gin Arg Phe Pro Thr Phe Phe Arg 
275 280 285 

Thr His Pro Ser Ala Thr Leu His Asn Pro Thr Arg Val Lys Leu Phe 
290 295 300 

Glu Lys Trp Gly Trp Arg Lys He Ala Thr He Gin Gin Thr Thr Glu 
305 310 315 320 

Val Phe Thr Ser Thr Leu Asp Asp Leu Glu Glu Arg Val Lys Glu Ala* 
325 330 335 

Gly He Glu He Thr Phe Arg Gin Ser Phe Phe Ser Asp Pro Ala Val 
340 345 350 

Pro Val Lys Asn Leu Lys Arg Gin Asp Ala Arg He He Val Gly Leu 
355 360 365 
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Phe Tyr Glu Thr Glu Ala Arg Lys Val Phe Cys Glu Val Tyr Lys Glu 
370 375 380 

Arg Leu Phe Gly Lys Lys Tyr Val Trp Phe Leu lie Gly Trp Tyr Ala 
5 385 390 395 400 

Asp Asn Trp Phe Lys Thr Tyr Asp Pro Ser lie Asn Cys Thr Val Asp 
405 410 415 

10 Glu Met Thr Glu Ala Val Glu Gly His lie Thr Thr Glu lie Val Met 
420 425 430 

Leu Asn Pro Ala Asn Thr Arg Ser lie Ser Asn Met Thr Ser Gin Glu 
435 440 445 

!5 

Phe Val Glu Lys Leu Thr Lys Arg Leu Lys Arg His Pro Glu Glu Thr 
450 455 460 

Gly Gly Phe Gin Glu Ala Pro Leu Ala Tyr Asp Ala He Trp Ala Leu 
20 465 470 475 480 

Ala Leu Ala Leu Asn Lys Thr Ser Gly Gly Ser Gly Arg Ser Gly Val 
485 490 495 

25 Arg Leu Glu Asp Phe Asn Tyr Asn Asn Gin Thr He Thr Asp Gin He 
500 505 510 

Tyr Arg Ala Met Asn Ser Ser Ser Phe Glu Gly Val Ser Gly His Val 
515 520 525 

30 

Val Phe Asp Ala Ser Gly Ser Arg Met Ala Trp Thr Leu He Glu Gin 
530 535 540 

Leu Gin Gly Gly Ser Tyr Lys Lys He Gly Tyr Tyr Asp Ser Thr Lys 
35 545 550 555 560" 

Asp Asp Leu Ser Trp Ser Lys Thr Asp Lys Trp He Val Thr Ser Arg 
565 570 575 

40 Thr Pro Ser Pro Thr 
580 
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((2) INFORMATION FOR SEQ ID NO: 60: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 16862 base pairs 

(B) TYPE: nucleic acid 

( C ) STRANDEDNESS : double 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

(iv) ANTI- SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo sapiens 

(ix) FEATURE: 

( A ) NAME / KEY : exon 

(B) LOCATION: 3415 . .3440 

(ix) FEATURE: 

(A) NAME/KEY: intron 

(B) LOCATION : 3 441 . .3903 

(ix) FEATURE: 

(A) NAME /KEY : exon 

(B) LOCATION: 3904 . .3988 

(ix) FEATURE: 

(A) NAME/KEY: intron 

(B) LOCATION: 3 989. .4689 

(ix) FEATURE: 

(A) NAME /KEY : exon 

(B) LOCATION: 4690. .4893 

(ix) FEATURE: 

(A) NAME/KEY: intron 

(B) LOCATION: 4894. .5645 

(ix) FEATURE: 

(A) NAME /KEY: exon 
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(B) LOCATION: 5646. .5831 

<ix) FEATURE: 

(A) NAME/KEY: intron 

(B) LOCATION: 5832 . .7181 

(ix) FEATURE: 

(A) NAME/ KEY : exon 

(B) LOCATION : 7 182 . .7202 

(ix) FEATURE: 

(A) NAME/KEY: intron 

(B) LOCATION: 7203 . .8307 



15 (ix) FEATURE: 

(A) NAME /KEY: exon 

(B) LOCATION: 8308 . . 8803 

(ix) FEATURE: 
20 (A) NAME / KEY : intron 

(B) LOCATION: 8 804. .12266 

(ix) FEATURE: 

(A) NAME/ KEY : exon 
25 (B) LOCATION: 12267. .12401 

(ix) FEATURE: 

(A) NAME /KEY: intron 

(B) LOCATION: 12402 . .12815 

30 

(ix) FEATURE: 

(A) NAME /KEY : exon 

(B) LOCATION: 12816. .12986 

35 (ix) FEATURE: 

(A) NAME/KEY: intron 

(B) LOCATION: 12987 . .14085 

(ix) FEATURE: 
40 (A) NAME/KEY: exon 

(B) LOCATION: 14086 . .14187 

(ix) FEATURE: 
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(A) NAME /KEY: intron 

(B) LOCATION: 14188 . .14473 

(ix) FEATURE: 

(A) NAME /KEY : exon 

(B) LOCATION: 14474. .14539 



(ix) FEATURE: 

(A) NAME /KEY: intron 

(B) LOCATION: 14540. .14998 



(ix) FEATURE: 

(A) NAME /KEY: exon 

(B) LOCATION: 14999. .15190 

(ix) FEATURE: 

(A) NAME /KEY : intron 

(B) LOCATION: 15191. .16862 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 60: 
GATCATATTA ATTTGAAGGT GGCGGGGCAG GATGGTTCTG TGGTGCAGTT TAAGATTAAG 
AGGCATACAC CACTTAGTAA ACTAATGAAA GCCTATTGTG AACGACAGGG ATTGTCAATG 
AGGCAGATCA GATTCCGATT CGACGGGCAA CCAATGAAAC AGACACACCT GCACAGTTGG 
AAATGGAGGA TGAAGATACA ATTGATGTGT TCCAACAGCA GACGGGAGGT GTCTACTGAA 
AAGGGAACCT GCTTCTTTAC TCCAGAACTC TGTTCTTTAA AGACCAAGAT TACATTCTCA 
ATTAGAAAAC TGCAATTTGC TTCCACCACA TC CTG ACT AC TACCGTATAG TTTTCTCTAT 
TCTTTCATTT CCCCCTTCCC CATTCCTTTA CTGTACATAA AGTAACTGGT ATATGTGCAC 
AAGCATATTA CTTTTTTTTT TTAAAACTAA ACAGCCAATG GTATGTTTTG ATTGACATCA 
AGTGGAGACG GGGGGGAAAA TACTGATTCT GTGAAAATAC CCCCTTTCTC CATTAGTGGC 
ATGCTCATTC AGCTCTTATC TTTATATTCC AGTAAGTTAT TTTGCTCTCA CTGTTTTAAC 
AACAACAACA AAAAAACAAC AACATAAAAA TCCTTGCATA CCTTGTTCAA TTGGAGAATT 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
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TTAATGTTTT TCATTTATCA TTGTAAAACC AAGGACAATT TTATAACTTT TTTGTACTTA 720 
GCTGTTACAT GCAGAGCAAT CTGTCTTTAA GTAGGGATAA ATTACTCTAA AACAAAAAAG 780 

5 

AATCCTAGAT AGTTTTCCCT TCAAGTCAAG CGTCTTGTTG TTTAAATAAA CTTCTTGTTT 840 
AAAAAAAAAA AAAGTAAAAA AGAAAAGTTA TGCAACAATT AATGGCCCAG AGGCAATCCT 900 
10 TGTTAACATT TTGATGCATC TTTTAGCTGT TTTTTTTTTT TTTTTTTTTT TTGACTGAGT 960 

TTGACTCTTG TCACCCAGGC TGAAGTGCAA TGGCATGGCA TGATCTTGGC TCACTGCAAC 1020 

CTCCGCCTCC CGGGTTCAAG TGATTCTCCT GCCTCAGCCT CCTGAGTAGC TAGGATTACG 1080 

15 

GGCATGCACC ACCATGCCTG GCTAATTTTG TATTTTTAGT AGAGTTGGGG CTTCTCCACA 1140 

CTGGTCAGGC TGGTCTCGAA CTCCCAACCT CAGGTGATAA GGGAAGGGGC ACTATTGACA 1200 

20 TTTATGGTTG GGGCAGAGGT GTAAGATATT CTTCAAAGCA CTACCTACAT GTTGAAGAAT 1260 

TGTTCCTCAC CCAGATTCTC AAAAGTCCCC CAGGACATTC ACGTAGTGAA AACCTGTGTT 1320 

TAATTATCTG AGCCTATAAC TTAATACAGT TTTAAAATTT TTTTTTAAAT ATACAGTGAA 13 80 

25 

CTTTCTAGGA ATGCAATTAT AGTTGTGTGT AAAATTAGGG AAAATTAACT TTGCTACCAA 1440 

GAGTTGTTCA ACATTTTGTT AAATCACTTC ATTGATGGCA ACATGCTGGA GGTAGTTGAG 1500 

30 TCACCAACTC AGCACCTGGA TCAGCCTGTG TTGGTAGCAG TTTCATCCCC GTGGTTCTGT 1560 

GAATAGGTGG AAGCATCTGC TTACTCCATC AGGACTTCTA GGGTAGTCGG GCCTTGGCAC 1620 

TCACACATTA AAATACTGTT TATGTTATTT TATTGCAAGT TACTTTTCTT TCATTTCCCC 1680 

35 

TTTACGTTAC AGAAAGGGAA GCATTTTGCT TTCTGTTTAA AGTTGTGTAT GTAGGTAGGT 1740 

TATATCATCT AWGACTTTCT CTCCCTCCTT CCCTTTCTTT TTGTTTGAGA TGGAGTCTTG 1800 

40 CTCTGTCACC CAGGCTGGAG TGCAGTGGTG CGATCTTGGC TCACTGCAAC CTCTGCCTCC 1860i. 

CGGGTTCAAG CGATTCTGGT GTCTCAGCTG GGATTACAGG CGCACACCAT CACACCACGC 1920 
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TAATTTTTCT ATTTTTAGTA GAGATGGGGT TTCGCCATGC TGGCCAGGCC AGGCTGGTCT 198XJ 

CAAACTCCTG AGCTCAAGTG ATCAGTCCGC CTCGGCCTCC CAAAGTTCTG GGATTTCAGG 2040 

CGTGAGCCTC ATCTATGAAT CTCAATTTAG GACAGTAAAA GTGTCATTAC AAAAATATTT 2100 

ATTGTAAAAA AGGGTTGGAG GTTGAGAATC TCAATTCTAG TCAGTCTCTC AGTGTTTGGT 2160 

TTCTTCCTAC CATTTTTCCC CCTAGGACCA GCCAGAAAGC AGCTTTTTTT TTGTCCCCCC 2220 

CAACAAGGAG CCCACTGTTT CCTCTCCCAG CCCAAACTCA GGCCTACGAA CAACAACAGC 2280 

ACTACACACA CACACACACA CACACACACA CACACACACA CACCCCTCCA CTTCAAGGTA 2340 

TAGCCAAGAG CTTCTGGAGC CGTCAAAAAG GTCTGTACCT GCTGTCTTTA GAGCTTCCAG 2400 

TTTGCCCTTG GTCAAGAAAT ACTGTTTGCT AGGCTCTGCT GGAGTACATC AGGTAATACT 2460 

GGCTTCTAAA CCACCCTGAG GTTCTTTTCT CTTGTCCTTT TACTCCCTTC GTACTTCAAT 2520 

TTCTCTCCTT GATGTCCCCC TCCCTGTTTT GTTTTTTGCC TCCAATCCGT TCTGCGCGTT 2580 

CCCTGCAGAG CAGGCGAGTA GCAATGCTGC TGGACCATGG AGCTGCTCTA GTCTCCCAGA 2640 

AATCTCTTCT ACACCCAACC CTTCTTGCGC TTAGGTGGTC CTCAGTCCCC CTCCCCCACC 2700 

TCCTTCTGAC CCAGGCTTCT TTCTCGCCCT CCGGTCGCAG TTCTCCTGGG CATCTGCCTC 2760 

TGCCTCTCTC CTCTCACCCG GATCTAGGGC TGCCTTCTCT TTGTGCAGCC GTCTTTCTCC 2820 

ACCTTCATCC CAGACTCCCT GTCTCAGCGC CAGCTCCTCT GCCTTTGGCT CGGGTTCCCT 2880 

CTCCCCCACC CCAGCTTCCA GTTGTTTGGC CCGCAGGTCC CTCGGCAGTG ACCGGCGCCC 2940 

CCCGACGAGT GCGTGTGCAC CAGGGCACCT CCCTCTCCCC CACCTCTCAG CCCCGCGCGT 3000 

CTCCACCGCC CGCCCCACCG CGCTGTGGGC GGTCCAGGGC GGGGCTGGGA TCCGGGGCGG 3060 

CTCCCGGGGC TCGGGTTGTG GGAGGCGCCC TCTCCCCGGT CTTCCCCTCT CTTCCCCCCG 3120 

CCCTGCCTTC CCTTGCACCC TCCTTCTTCC CTCCGCCCGG GAGCTCTCCC TGGTCCCCGG 3180 

CGCCGCCTCC TTCCCTCCCG GCTCCCCGCT CCCCGCTCCC GTGGCTGCCG CCGCCCCGGG 3240 
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GAAGAAGAGA CAGGGGTGGG 

GGCTGGAGCC TGGATTCGAG 

5 

GAGGGGGGAG CGGGGAGGAG 

AGCCGGGCCC GCGCGCCGAG 

10 GGCGCGCGGC GTGGTGGGGG 

CTGGGGGAGA GGAGGAACCG 

GAGCGTGGGT AAGCCAGGCT 

15 

GGTGGAGTCC GGGAGGGTTG 

TCTCCTCCTC TCCCCCCACC 

20 TCTTGGTCCC CAGGCTGAGC 

GTCTCCGGGT GCGGGCCGCG 

TAGATGTTGC TGCTGCTGCT 

25 

GCGCAGACCC CCAACGCCAC 

TTCGCTCCAC TTCCCTTTCC 

30 GTCCCACATA TCATCC CGGT 

ACCTCCTTCT GTCACCCCAC 

ACTATACCAC CTTCCTGTGC 

35 

TACCTCAATT ACTCCCTTTT 

CCTTTATCCT CCTATCCTAA 

40 AAAGCTCCAC CCACATATCA 

CACCCAGCAC CCTTCCCTCC 



PCT/SE98/01947 

GTTTGGGGGA AGCGAGAGAG GAGGGGAGAG ACCCTGGCCA 3300 

GGGAGGAGGG ACGGGAGGAG GAGAAAGGTG GAGGAGAAGG 3360 

CGGCCGGGCC TGGGGCCTTG AGGCCCGGGG AGAGCCGGGG 3420 

GTAAGAGCCA GGGCCCCGGG TTAGCAGGGC TCGGAGAGGG 3480 

AGGGGGCAGT GGGCGCAGGG CCCAGCTGGG GGAAGCGGGG 3540 

CGGGGATGGA ATCGGGGAGC GCTGAGGCGG CCGATGCCGG 3600 

TCTGCGAGCC GCGGGGGCCG GGGGAGAGGA GGTGGTGAGA 3660 

GGGGCCGAGG GAGGCAGGAG GAGGGTGGGG ACAGGCTTTC 3720 

CCGCGCGGGG CTCCGCCCCC GCCTCCTCCG CGGGGCGCTC 3780 

CCGGTCGGAG CCTGCGAGGC AACCGGCAAG AGGTCGAGTA 3 840 

CCGGCGGGGC TCGGTCCAGT CCTCATGGCC GCCTCTCACT 3900 

ACTGGCGCCA CTCTTCCTCC GCCCCCCGGG CGCGGGCGGG 3960 

CTCAGAAGGT GCATCCTTCT TCGACGACCT CCGGCCCTCC 4020 

CTGCATCTCC TCATTTCTGG TCCTCATCAC TATCCCATCA 4080 

CTGGCAACCC CTTCTGCTCG GCCCGACTTT ACTACTGCTG 4140 

GTTACTATCC AGCACCTCTT TTCTCTGCCC ACATTGCTAC 4200 

ATTTTCTCCG CCTCAATCCC CTTTCCCAGC CCCACATTAC 4260 

CTTGGTCCCA CTTTGCTGTC CAGATGATCT TATTAGCCTC 4320 

TTCAACTGGA ATATCCTCAT TTAGCCTTTT TTTTTAAAGA 4380 

TACCCTTCAT GATTTCTTAA TTACTTTTCT TTCTTACCTC 4440 » 

CCACTTGTGG GTTCTCTCAT CAGCTTTAAC CCTGGCCCTT 4500 
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TACTCTCTGT CCTTTAGCCA GGGGATCTGT ACCTGTCCCC ACTCCCACCC TCTAGTGC'CC 456"0 

CATCCCTCTT CCTCTGTCCC CAGCCTGCCC ACAGACCACG CCCTACTCTC CCCTTCCTCC 4620 

CACTGGGGAG CCTGCCTTTT CCTCTTTCCC ACCATTCCTC TCTGTATGCC TCCCCGACTC 4680 

ACCCCTTAGG TTGCCAGATC ATACACCCGC CCTGGGAAGG GGGCATCAGG TACCGGGGCC 4740 

TGACTCGGGA CCAGGTGAAG GCTATCAACT TCCTGCCAGT GGACTATGAG ATTGAGTATG 4800 

TGTGCCGGGG GGAGCGCGAG GTGGTGGGGC CCAAGGTCCG CAAGTGCCTG GCCAACGGCT 4860 

CCTGGACAGA TATGGACACA CCCAGCCGCT GTGGTGAGTA GCCTCGGAAG CCCCTCCCCT 4920 

CTTCAAGACT ATTCCTTTTC CTGCCGCAAA CTTAGCATTA CTGCTTGCAA GTCAGCACTT 4980 

TAAATCCAGT ATACCAAAAT TCACAAATAC ATTTATTGAA TGACTACTAC ATAAGAGCAA 5040 

TTTTGCTCTG TGCGGTTGGA GGTAGTAGAG CTAGCAGCCT GCACAGTTCA . TTTCATCCTC 5100 

CCTTCATTAG GCCACTGATC ATTGGC CT AT AACATTGATA ATTCATCTTG TCAGTTATTC 5160 

TCTTTGAGGA TCATTAGTGG CAGATGATGA CAAAAAAATT CTAAAATGAT TTCATCACAT 5220 

TTTTGAATAC CTCTGTCACC AACCCAGAGA CCATATGCCC AAGAAACAAA AGCCAGTTTA 5280 

ATATTAATAG AAGCCAACTA TAATAAGAAA AGCAAATCTG ATTGTGCATC CAAAGTTATA 5340 

TACATCTACA TATTTCAAAG CCAGAGAACC GCCCACTGTA GCTGACTTTG AAGAGATCCC 5400 

ATTTTGTGTG CTTATAGCCC CATCTTGGGT TCCTAAAATG GTAATTTTTT TTTTCTTTTG 5460 

GGAATGTGTG GATGCTTGCA CAGGTAAGGG AGGATTGGAA GATAGGTAGG CAAATCCTTT 5520 

TCACATGTGA TTTTCTTTAG AGCAGGATGC TTGTGGACCC AAACCTGCAC CTGAGTCCC€ 5580 

TGCTCTTTAA AGGGAAAGAG CCTTCTTCAA CTCGCCTCTC TTCTTATTTT CCTATCTCTC 5640 

CACAGTCCGA ATCTGCTCCA AGTCTTATTT GACCCTGGAA AATGGGAAGG TTTTCCTGAC 5700 
GGGTGGGGAC CTCCCAGCTC TGGACGGAGC CCGGGTGGAT TTCCGGTGTG ACCCCGACTT 5760 
CCATCTGGTG GGCAGCTCCC GGAGCATCTG TAGTCAGGGC CAGTGGAGCA CCCCCAAGCC 5820 
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CCACTGCCAG GGTGAGGGGA ACAGCTGCCT GCATGCAGCT GATGAGGACG CTTGTGTGAG 5880 

GATGGGAGTG GGGTGGGAAT GGATAATGGG AAAGAATGGA GAGCTATAAA AATGTGGGGG 5940 

5 

AGGACACTGG AAAGGGGAGA TGAAAGTCCC TTTTTCCTCC ATCACCTGCC TCAAACTTCC 6000 

TCTTGCAGTC CCCGGTATCC TCTGTAGGTT GGGGGCTTCC TTCCTTTACC TTTTAAAAAA 6060 

10 ATCTTCCTGC TCC CGATTCT TAGACCTCAC GTTTTCTCTT TTCCTTTATG AATCTCACCT 6120 

CTCTCACCTT CTTCAGGTTT AAATACTCCA ATTTTCCCTT TCTCTAAACT TAGAAATTTC 6180 

CATGCATCAC CCTCTTCTAG AATTCATCCC TCACCATTCC TTATATAATT GATTTATTGT 6240 

15 

AAAGACTCAG AAATAAATCA AACATTCTAC TAAGAAAAAT TGAGAAGGGG AGCTCTGGGG 6300 

GTGGAAACAT ATTAGGGTAA AAGACTTAAA ATTGGAGGCA GCATTATCAG AAGATGAAGA 6360 

20 ACAACTCAGG GATGGGGTGG GAAGAAGACA GGTCCTTTTC TGKACTTCCT AGACAACCTC 6420 

CATTATTCCC TAAGGGAATC AGTGTTGTGT CTGTCTACYT TTTTTTTTTT TTTTTTGCCA 6480 

CGTAATTTTA CAAACTCTCC CTTTTCTAGG CACCCGAACT CTCTGCCATC TTCTCTCCTG 6540 

25 

GGATGCAGTC ATCCCATTTG TATGCCTCAT ACTTCCTCTA CCCTGGTAGA TTCTTTCAAG 6600 

ATCCTTGGGC TTTACTTTCC TCACATAACT CAGTTATTCT GCTTCTAGTT TACCATTTTA 6660 

30 TTCTGGAAAT TGAGAGTCCC ATCCAGGGGT GGACTTATGA CACTACTGAA ACTTAGACTT 6720 

CAAGGTTCCT CACCTACAGG GCCCTCTTCC TGTGCTCTAA TAATATAGAG GGCTCGATGG 6780 

ATATGTGTTC ATATGGTAAC AGGCTTTTGT AAAAATTGCA GAAATAAGAT TTTAACAGCA 6840 

35 

ATTGCTTAAA GCCAATTGTA TGTGTAATTT TTTTTCTTAA AGACTCCCAA TTTTGTAATA 6900 

TTCAGGCACC ACAGAACCAA GATCTGCCCC AAACTTAGCT ATTGGCATTC CCGTCTCAAA 6960 

40 TTCTGTTGTC CTATGAAAAA TCGAAGAAGA AAATAAGTC C TGACCCCCTT ACCCCCAGAC 7020 ^ 

CCACCTTGTT CTTATCCCCA GGCACCCTCC CCTCAGAAAC GCAGGCTTCT GCTCTCCCCG 7080 
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GTCTTCAGCA TGGACAGGTG TGGGAGGGGG CTGGGGATCA GGCCAGGGAA GCTGGGCGCC 71*0 

AGTGGTAACT CTTCTCTGAT CCCCGTCTTT CCTGCTGCCA GTGAATCGAA CGCCACACTC 7200 

AGGTGAGATG AGAAACCCTT ACCGCGCGCA CTGCAATGCC CTCCCCTTCA CTCTGCACCC 7260 

TCCACCCCCC TGAAATTCTG CCCTTAGGCT ACGGGGCGTC GTCCTTTCGC ACCTTCCCCA 7320 

ACCCACCCCA GTTTGCGGCC ACCCCCTTCC CTCCCTACCT GTTTCCTGCC TCCAGTCCCG 7380 

GTTTTCCACG AGGCTGCGGT CTCTCCTTGT CCCTGCTTGG CTACACTTCC CTGGGCTCCA 7440 

CCTCCTCCCA GACTGAGCCT CGCCGGTGTC AGGCAGAGCC CAGCAGARGG CGGCAGGGTG 7500 

CTGGGAGACC CTGAGCTCCC ACCACGTTTT CCCCTGTGGG GTTCCTTGCG AC CTTCGCTG 7560 

GAACCTTTTC CAGCCTGCTG CCTCCTAGGA TTTCACCTAA TGGACTTTCT CAGCCTGTCC 7620 

CACCCATCCC AACCCTGGCC AGGCCTCTCG CGCTCTTCCC CACATCTTTT CCTTCCGTGT 7680 

ACCCCTTCCC TCGTCTTTTC TCAATTCCAT GTCCTGTCTC CCTTTCTTAG GCTTCTGTCT 7740 

ACCCAGCCCC AGGCTCCCTT CCACGACCCC ACCACTCCCT CAAACCAGCC TCCCTTCCGT 7800 

ACCCAACTCG TTCCCTCCAA AACCGTTTCC TCTCCCCCAC ATCCTCAGTG CTTCACTGTA 7860 

TCGACTCATA CTCCCACTTC AGACCTCAGG CGCCAGCCCC GTTTCTCTCC CGTCCCACTC 7920 

GCATCCTTCC CTTCCTACCC TGGTTCCTCC GTGCTTCAGC CTCCCGCGGC TCCCTCCGCC 7980 
CACCCCGCCC TCCTGGCACG CCCCGTCCCC ATTTCTCCTC CCCTCGGGTC CCCTTAAGTG 8040 
AGATCCCTCC CTTCCTCTTT CGTTCCTTTC CTCCTCGAGG TTGCATCCCC CCTCCCCTCC 8100 
CCGCCCCTCC GACTGTCGCT CCCACCTCGG CGCTCGCTTC CCTCCCCGCC CCCTTCCTGC 8160 
CTCCCCAGCT CCCGCCCGCC CCCCCACCCC CCGCTGCCGC GCGCCGCCCG TGACGTCAGA 8220 
GCCCCCTCCC AGCCCCACAT CTCCCTCCTG CTCCTCCTCC TCCCCTCCGT CGGTCAGTCA 8280 

I.- 

GTCCGCGAGG AGAGTCCGCG GTGGCGGCGA CGGTGGCGAG AGCCGCGGGG GCCGTAGGAA 8340 
GCCAACCTTC CCTGCTTCTC CGGGGCCCTC GCCCCCTCCT CCCCACAAAA TCAGGGATGG 8400 
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AGGCGCCTCC CCGGCACCCT CTTAGCAGCC CTCCCCGGGA AAAGTGTCCC CCCTGAGCTC 8460 

CTAACGCTCC CCAACAGCTA CCCCTGCCCC CCACGCCATG GGGCCCGGGG CCCCTTTTGC 8520 

5 

CCGGGTGGGG TGGCCACTGC CGCTTCTGGT TGTGATGGCG GCAGGGGTGG CTCCGGTGTG 8580 

GGCCTCCCAC TCCCCCCATC TCCCGCGGCC TCACTCGCGG GTCCCCCCGC ACCCCTCCTC 8640 

10 AGAACGGCGC GCAGTGTACA TCGGGGCACT GTTTCCCATG AGCGGGGGCT GGCCAGGGGG 8700 
CCAGGCCTGC CAGCCCGCGG TGGAGATGGC GCTGGAGGAC GTGAATAGCC GCAGGGACAT " 8760 

CCTGCCGGAC TATGAGCTCA AGCTCATCCA CCACGACAGC AAGGTAGCCC TGGACATGGG 8820 

15 

GGTGGGTGGG AGGTGGGGGC TTGCGGGSCA GGGGGCCAAG CAAGCCTGCA CGCGCCCCCA 8880 

TCTGTCTGAG TCGTCTCTGG GATTGCGAGG CAGACCCCTC CCTTGTGTGA CTGGCAGGAG 8940 

20 ATGGGCTGGG GGTGCAGGAG CTTGGGAAGA GTCGCAGGGG CTGGAGGTCC AAGATGAGGG 9000 

TCTAGGGGCT CAAGATGGTT AAGCATGCTG CAAGGCAGAC CCTTCTGCCC CGCTGCGGGA 9060 

GTCTCGCAGA AGTGTCGGGG TTTGGAGAAA CTGGTGGTGG ATTTAAGGTA TTAGGAGACA 9120 

25 

CTGATCCTCT GAGGGAGTAA ACTAACCCTG GAATGGGTTG GGGGTGGAGG GAATGTCAGA 9180 

GGTGGGGAGC TGGATTGGGG GGTTACATTT ACCATGGTAA CAAGGTAAAA TCTTGGCGTA 9240 

30 GGTTGGAGCT GGAAGGAATA GGGACAGAAT GAGGAAAATT TTGAGAGACT TGAGAGCTCT 9300 

AGTTTATTTA TCTTAACAAA ACAGCAAGGT AGTGGTGAGC CCTACCTGAC TCCTTCTCAT 9360 

CCTTCTATTC CCAACCCTGT TGAGCATTCC CAGACTGTGG GATAGATGGC ATATGGTGAT 9420 

35 

TGGGGAAGGC TAATGATCAA GAGGTGGGCA GAGGCACTGG GAAAATGAAT TGGATTGGGG 9480 

ATCCACATGG GAACCCCCAC AATAGCATGG GGATGAAGAA GAGTCAACAT ACAAGGAGAA 9540 

40 GAGAACAGAA AAGAATGGCA GTGGGGGAGA GGGGC AAGGA GGTAGCGTGG GGATAATGAG 960 0 U 

AGATCTTGGG GCACCTTATG GAACTTGGGT CCTGACCTTC CCTTCCCTTA TAGCATTGTG 9660 
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GCCTCTAGGA TGTGAGAAGG GAAATGGGAT GTAGGGATTA GGGAGGTG AG TTGAGGGAGA 9T20 

GAGAGAAGGT AAGCAAATTT GGGTCCAGGG GTATTAGGGG ATAGCTTATA ATGAGGTTTT 9780 

TTTTCCCACC CCTCTCCCCT ACATGAATAA TTGGGGGTGC AGGGAAGGAT GTGACACAGG 9840 

GAAGGAGATT TAAGATCTCA AATTTATCTT CACTGACATG TGGCCCCAGA GACTTAAGGA 9900 

ATT&GTTAG GGTGAAATAG AGTACACAAG GTGAGAATTT GGTGATCTTA CCAAATATCA 9960 

ACCTTGGGGT GATCCAAGGA TTTATATTCA TTTTTAGAAC ATCACTATAC ACCTAGAAAT 10020 

AGGTGTGTGT CTGGGATAGG TGTGTGAGGG GACAGAAGTG AGGTTGAAGG TAGGGTGCTT 10080 

GAAGAGAAGA GAGCACAAGG ATTATCAGGA GCTTGGCAAG AGAACTTAAA ATCCTTTTTG 10140 

ACTGTTACTT TCTCGTGGTT CTCAGCCTTC AGTGTACATA AGAATCACCA GAGGAGTTTG 10200 

TTAAAAATAC AGATTCTAGC TCCTTGGTCA GGGATGAATC CCAAGTATTT ATCTGTATTT 10260 

TTACTAATAG ACATCCTATC TTGGTGGATT CCTGAGCTGT AAGCTAACCC CAGAATGCCT 10320 

ATGGGAAGAG CAGCAGGGTA CAGGAAAATA ATTAGGTATT AGGGTACGGG AGGCAGGAAG 10380 

AGAAGTAGAG GATCAGATCT GGTAGAGGGT CAGACTTGGG ACAGTCAGAG AGATCATTGG 10440 

TTTTGGGGAG TGGAGTGTGA AGAAAATGAC AGGGAGAGAT GGGTGCAGGC TTTATGATAG 10500 

GGGATCACAG GAGATAGGGG AGGCCTGGCT GTGAGCTCAA ACTCATCCAC CATGACAGGT 10560 

GATTCCCTGG AGGTGGCGGG GAGCAGACGT GGGACCTGGG AGAAGGGAAC TGGAGAACAT 10620 

CAGAGGCATC AAGCGGGGTG GGATGGGAAG GCAGAAGAAC CAGAATGTGT CAATTGGAAT 10680 

GAGTCGGTTT CCTGCCTGCA AATCCAGATC CTTGCAAGAG CAAAGAGAGG GAGGAGAACT 10740 

AAGGAAATCT ATTGGGGAGG GGGAGAGAAT CACGTGGTGG AGAGAATCTG CAGTGATGAA 10800 

TAGTGTGTGG AAGAGGGAAA CGGTTGCAAG AAAAGGTAGA TAAGAAATCA GGAAACAAAA 10860 

TGGGGGGCAT GCCTTGCCCT GTTGATATGT ATCTTATATG TTCTTGAATG TCCTCATTGK 10920 
TCCTATTAAC CCTGTCTTTA GAGAAGTGGA GGGGCACTGA GGGGCTGTGG GAGAAGCTGG 10980 
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GAGCAGGATC TGGAGTAATA 
AAAGATTTAC AAAGTTGCCC 

5 

GCACTGCCCT TGGCCAGAGT 
GTTCTTCAGA TTCCAACAGC 
10 AAATGTCAGG CCTCAGTGGG 
ACTTATCCTC CCCTCCTGGC 
GTCCCCTCTG CCCTGGCTCC 

15 

AGCAGCTTCT GAAATGTCAT 
TGGCTCTTCC TCCCCCATTT 
20 TGTTTACTTT CCCTAGCTCC 
TGAGAGTTCC ATTCTATTCA 
CCACCCCCCA GTGTCTTTTT 

25 

ATGGATCACT GAGCGGGACC 
AGCTCTGAAA AGAGAAGAGG 
30 GTGAGGAAGG AATTTTCATA 
TTACCGTATT CCTACTGACT 
AAAGGGCTTG GTGACAGAGG 

35 

GGTATGTTCA TTAATACCAA 
TGGAAGACAG AGAAGGAAGT 
40 TGCCAGAAAG GAGAAGAAAA 
AATTTGTTCG AGAAAGATAA 



PCT/SE98/01947 

GATGTGGGGA GAGTGCAGGA AGGTGGGTCC TGAGAATGGT 11040 

TAGTGGGAGG CATAAAGAGA AAACCTTCCA ATGTTGTTGA 11100 

GAGGGTAGGG TGGGCAACAG AGAATTCTCA GTGACTGCTG 11160 

TTCCCCTGGC TCCCCCTTCT CCAACTTCCC ACCGTGTCCC 11220 

AGGTAAGCAG GCTCCAGAGT GCTTTCTTTA TTTCCTTTCT 11280 

AACATTTCAC CCTCCTTAGT CCCCTGAGCC CCCTGTCTGT 11340 

CCACTGGCTG CCATTTCGTC TTCACATGCA TTGGGGTTCC 11400 

ATATCAGTGG GAGGGGAACA GGCAGTGGGA GACCCAAGGC 11460 

CCCCTCCTCC CAAGCTTCCT TTCTTCTCCA GCTTTCTGCT 11520 

AAGCCTCTCT TTAAGGCACC TCTCAAATTG KCTGGTTTCT 11580 

TTCTCTCTGT TCTTTCCTCA TCCTACATTC TTCCCTACTT 11640 

TTCTAATGGA CCTGTCAAAT GTCAGCGCCC AGCAGGAGGG 11700 

CCCTACTGGT CTTGTTCCTG TTCTCTCTTT ACTTATCACT 11760 

GAGGAAACAA ATGGAAGGTG GGGAGAAGGG GTTTGCAGAG 11820 

ATATGGCTTT GAGCAAGCTA TCTGGGGATG TGGAAAGAGT 11880 

TCTTCCACCC ACTGGTGTTT GAAGCATAGA AACATGGGGT 11940 

GAAGGGGGAT GTCTGAGGGT GAGCTGAAAG GAGGTAAGGT 12000 

AGGAGGGGTG TGCAGGAGAG GTGATGGGTA AGGCTCCAGA 12060 

TTAATGAAAG AGGAGAAAAA AGGCACTTGA CAGGAAGAGA 12120 

CGGTAATTAA TGATGAAAGT GAGTAATTGA GAAAGGAACT 121 80-, 

GAGCAGGAAT TGCAGACAGG GGAGGGGCCC CAGGAGAGCT 12240 
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TGCCCTCATC TCCTCTTGTC TTTCAGTGTG ATCCAGGCCA AGCCACCAAG TACCTATATG 12300 

AGCTGCTCTA CAACGACCCT ATCAAGATCA TCCTTATGCC TGGCTGCAGC TCTGTCTCCA 123 60 

CGCTGGTGGC TGAGGCTGCT AGGATGTGGA ACCTCATTGT GGTAAGCAGG GCTATGGGGG 12420 

TCAGAAGATG GGGTCATTCC CTTTTGAGCT CTACTGAAGG GACGATGGCG ATTGTGGGTT 12480 

TGTATTGAAA AGGAGTGTGG AGGACCTGCT ACTAAGATTC AGAGTCCTCT GCAGACCTGA 12540 

GCTAGGCAGC CTCCTAGCAA CAGTGGCCTG ACAGTGCTGC AGCTGACCTC CTTCTTCAGA 12600 

AGGAATTGAA ATTAGATCAG TGAAAGAGCA TCCCGGTTGT GAGGGGTGTG TGGGCCTTTG 12660 

AGAATCTCTT TTCCTTAGGC AGACCAGAGG TGGGGAGGTT TGGAGAGAGT AAGGAAGAGA 12720 

AACCCAAAGG CAGGAAGAGG GTTAAAGGAA CTCTTGGCCA CTCTTGGTGT CCTCAGTGAA 127 80 

CAGACCCTGT TGCACTCACT CTCCCTGCCC CACAGCTTTC CTATGGCTCC AGCTCACCAG 12 840 

CCCTGTCAAA CCGGCAGCGT TTCCCCACTT TCTTCCGAAC GCACCCATCA GCCACACTCC 12900 

ACAACCCTAC CC GCGTG AAA CTCTTTGAAA AGTGGGGCTG GAAGAAGATT GCTACCATCC 12960 

AGCAGACCAC TGAGGTCTTC ACTTCGGTGA GGAGGGGTTG GGCAAGGGGT AAAGGGACAT . 13 020 

AAGCTCAAAT TCCAGCACCA GGAGATGTGA CGTGAGAGTC ACTTTTAGGG GCAAGAACTT 13 080 

GATTCTTCAT TGAAAGAGAA CGCATTCCAT GTGGATTAAG TGCAGTTCTT TCTGTAGCCA 13140 

GGGGAAAGAA TGAGTTGAGT TTTTGGGATC CTCTCTGTCT TTATGATTTT ATGATTTTTT 13200 

TCCCCTGTTT GATGCCCTGT TCCCCAGACA TATAGACCCA GAATGACTCA GTTCTGTTAA 13260 

AGTAGGTTCA ATCCAAAGTG GGGGCAAGAG ATGGGAGCGA AGATGAGATA GGAATCCAGG 13320 

AAGGCAGCAG ATTCCAGAGG CTTTCAAGGG GGGTGGTGGG TGGGTGTGAA TGGGAACAGA 133 80 

GGGGATGGAG CCAGTGGATT ACAGAGGAGA GAGGGAGAGG AAAGAGAGAG AGAGAGAGGA 13440 

ATGAGGGAGA GGAGAGAGAG GGGCAGAAAG GCAGCTGCAT GGATCTGGTA GTTGGTACTA 13500 

AGAGAGAGAA GCCGACAGAC AAGGAGAGGT TGAGGGGGAA GAGGGAGATT TGGGGAGGTA 13560 
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GAGAGGAAAT ACAGGCTCTA CATCTGAAGA AGGCAGTCTG CTCCCTCCCT TTTATTCTAT 13 620 

TCTTTGGGTC TTCTATCCAC TGTGTTCAGT GGCCCTTTAA TCCTCCCCCA CTTTCACTCT 13680 

5 

GATTCAGACC ATTCTTCTCT GATCCTTTGT CTGTCTGCCC ATTTGCCTCT TGAGGTTGAC 13740 

ATCATGCTGT CTGTCCCAGT CCTTGCCTTG TCTTTTCCTG GTTCCTTTAT GTTTCTTTAC 13 800 

10 CCCATCTTTG CCTTCAGTGG TAGGAGTGGG TGAATGGAGT GGCTTCCCCC ACACAGAGCC 13 860 

TCAGCAGGGG CTCACCATTC ACCTTCCCAC TTGGAATCCA CATCCTAAGA CCAGATGCCT 13 920 

TCCCGAACTC CTCACTTCAG GGACAGAAGC TGTTGAAGGA AGGTTCAGAA TGGCTGCTTC 13980 

15 

TTTGCTCTAT CTGAGTATTG CTCTGAAATC CCCAGTTAAC CTCTCTGGTC TTTATTCCCT 14040 

CATGCACCCC GTGTTTTTCC AACTTGTTTT TTATTCCCAC CCAAGACTCT GGACGACCTG 14100 

20 GAGGAACGAG TGAAGGAGGC TGGAATTGAG ATTACTTTCC GCC AGAGTTT CTTCTCAGAT 14160 

CCAGCTGTGC CCGTCAAAAA CCTGAAGGTC AGATGGCTGG GAGTGGTGGG CTCTGTTTAC 14220 

GGAGGGACCA AGCTGGGGGA CAGTGACTGG TTGGAGAGGA AAGCCAGGCG GGGGCAGGTT 14280 

25 

TTGATTCTCT GAGGCAATAG CATCTCCTGG GGAAGTTTAG CTCCATCTTC CAGTTGACGT 14340 

TTATTCACTA TACGTTGAGC GTTACCCTGC ACTAAGCACT TTGGGATGGG AAATCAAAGC 14400 

30 TGTGAAGACA TCTGGCTTAG CCCCTCAGGC ATTCCCGGGC ATCCCTCAGG AGCTGTTTCT 14460 

TTCTCTGTTG TAGCGCCAGG ATGCCCGAAT CATCGTGGGA CTTTTCTATG AGACTGAAGC 14520 

CCGGAAAGTT TTTTGTGAGG TGGAGTTGGA TCTGAAGAGG GAGGGGCACT GGGTGGGAGT 14580 

35 

TTCCCTTGGT TTTCTTGTGG GGCCTCCTCT TGGCATCTGT GCCTGAGTTG ATAGCATATG 14640 

ATCTGAGGTG ACGATTCATA GGATGTCTCT GTCTGTTGGC TCTGACTGCA TCCCTTGTCT 14700 

40 GCACACACAT GATACTTTCT TCAGATCTCA TTTTTCTACT GCTTTGTGTT TCCTGAGAAG 14760a 

CCCATGAATT CCATCTGTCC TGACTGGCTG GAAAAGGCCA CTCAGAAATA CAGGGGCTGG 14820 
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GGAGAAACTT AGAAGGAAGA ATTGTCAGCC TTTCCTACTA TCCCCAAGAC TTGTAG ATTT 14OTD 

CTCTTTTTAG TTCTACTGCT CTTCCCTGAT TCCCAAGAGG CTAAATAGTA TCAAGTGAGA 14940 

TAAGACAAAA ACAAACAAAT GAGCAAACAA AAACTCAGCC ATTCTCCTCT GTATTCAGGT 15000 

GTACAAGGAG CGTCTCTTTG GGAAGAAGTA CGTCTGGTTC CTCATTGGGT GGTATGCTGA 15060 

CAATTGGTTC AAGATCTACG ACCCTTCTAT CAACTGCACA GTGGATGAGA TGACTGAGGC 15120 

GGTGGAGGGC CACATCACAA CTGAGATTGT CATGCTGAAT CCTGCCAATA CCCGCAGCAT 15180 

TTCCAACATG GTGAGAGTGT GGGGACTTGC AGTCTGGCAC CTGGGAGGGT GGAGAGGACT 15240 

GAGGGGGCCT TGCAGGGGAA AGGGTGGCAG GGAGAGGGTG CGGAATTTGG ATATAAAGGA 153 00 

GAAGAGGGGG CTGTGCCCAC CCTGAACTTG TCTGCATTAT GTTTCCTGTG GATCCTACCT 15360 

TTGCTCTGAC TTCCTTGGGT AGAGAGAGAA AAAAAAAAAA ACGATGGAGT TGTATGTTCA 15420 

GTAGGTTCCT GATGAGTGGA AGGGCTGTTA CCATGGAGAC GAGGAGCAGT TGGTGAGAAG 15480 

TCAGGAGGAA CCGGCATTAA TGATAATATG GATGCTTGTA TACTCAAGCA CACCTTTACA 15540 

GGAGCACTGT GTCTGGGCAG AATTGCATTT CATTTTCTTG GTGATTTATG TTAGTGTTTT 15600 

AGAGTTGCTT AATATTCACT CATGATTGAT ATGCAATTAG CTTGGATCCT GTTGCTTTTT 15660 

TTAATGATTC TTTTTTTAGA ATTTTATGTG GAGAAGGGGC TTTTGAAATC ATTTAGCCCC 15720 

AGACAGCTGG TTAATGACAG ACTTGGAGCG AGGAGGCGCA TTCTCTGATT TGAGAGGGCT 15780 

GGGGCAGCCC TGCAGCTGAG GGGCAGAAGT GAGTGTGGGT GTGGGCACAG GGGAAGAGGC 15840 

TTCTTTGTTG CTCAGCACTA CATTGTAAAG AGTACAGGAA TTTTGGGTAA GCTCGATACG 15900 

GCCACCTATG TGTCTGCTTA TTACACCAAG TGTGTACCTG CTTTGGAGAG CATGTTAGGG 15960 

GTGAGTTAAT GGTTGTAAAG CCCCAAAATA GCTTGATTTA AAAAGTTTTA CTAACAGTGG 16020 

CATTTGTGTG GTACTTAGAT TCATGATTTC AGGGATTCTA ACAGCAAAAC TCTGATGTGG 16080 

ACTTATCTAC ATTTTGTAGA CCAGGAAAAT GAGACTCAGA GTGGTTTAAT GACTTGTCTC 16140 
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AGGTCATGCA GCTACTCAGC AGTCCAGCTG GAATTGAATC CTAGTTAAAT CTGGCTCCAA 16200 

ACTGCTCTGC TCATCACTTC CCATGGAAAA CCATGAAGCC TGACTAAAAA GTTTGTACTT 16260 

TTATCTGCTA ACACTGTTTA GCTACTGGCA ATTGCTGAGC AAAGGAGAAA TAAGATTTAG 16320 

GAAGTTGACT CTGGTGGTGT GTTGTGATGA AAGATCTGCT GCCACATGGA CTCTGATGAT 16380 

ACACACTGTT TTGGGACATT CTAAATGGAC TCTATGTTTT AAGGTATGTT AGTTGAAAAA 16440 

TACAATGAAA AAATTTTGGA AGATCACTTT CCTGTAGAGT ACCCAAGAGG TGAGTCATCC 16500 

ATAAAACCTG GGAAAGTATT GGAACTGGCT ACCAAAGCCA GAAAGACTTG TTGGTGCCCT 16560 

GAGCCCAGGA CTTGCCAATC GTTGCTTCTG TCTTATAGCA GGGAGCTAGA GGAGGCTGAC 16620 

AGAAAACAGG GATTTGGCTG GATCTAGTGG CTCACACCTG TAGTCCCCAG CACCTTGGGA 16680 

GGCCAAGGCA GAGGATCACT GGAGCCCAGG AGTTCAAGAC CAGCCTGGGC AATATAGTGA 16740 

GACCCCCAGC TCTAAACAAA AAGAGAGAGA GAGAGAACAG AGAGATGGTT AGGGACATAG 16800 

AGGGACACCT GAGCATTGGG GTGGGAAGAC TTGGAAGGTC CCTTTCTGAC TTTGTGTCTA 16860 

^ 16862 



(2) INFORMATION FOR SEQ ID NO: 61: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 16707 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 
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(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo sapiens 

(ix) FEATURE: 

(A) NAME /KEY: intron 

(B) LOCATION : 1 . .2742 

(ix) FEATURE: 

(A) NAME /KEY: exon 

(B) LOCATION: 2743 . .2985 

(ix) FEATURE: 

(A) NAME/ KEY : intron 

(B) LOCATION: 2986 . .3613 

(ix) FEATURE: 

(A) NAME /KEY: exon 

(B) LOCATION: 3 614. .3 677 

(ix) FEATURE: 

(A) NAME /KEY : intron 

(B) LOCATION: 3 678 . .5226 

(ix) FEATURE: 

(A) NAME /KEY : exon 

(B) LOCATION: 5227. .5304 

(ix) FEATURE: 

(A) NAME /KEY : intron 

(B) LOCATION: 53 05 . .6848 

(ix) FEATURE: 

( A ) NAME / KEY : exon 

(B) LOCATION: 6849. .6999 

(ix) FEATURE: 

(A) NAME /KEY : intron 

(B) LOCATION: 7000. .7494 

(ix) FEATURE: 

(A) NAME /KEY : exon 

(B) LOCATION: 7495 . .7627 
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(B) LOCATION: 11370. .11591 



(ix) 



FEATURE : 



(A) NAME/ KEY : exon 



5 



<B) LOCATION: 11592 . .11735 



(ix) 



FEATURE : 



(A) NAME/ KEY : intron 



(B) LOCATION: 1173 6. .12512 



10 



(ix) 



FEATURE : 



(A) NAME/ KEY : exon 



(B) LOCATION: 12513 . .13999 



15 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 61: 
AGCTAATTTT GGAATTTTTT TTTTTGAGAC AGAGTTTCAT TCTTATTGCC CAGGCTTGAG 
20 TGCAATGGCG CGATCTTGGC TCACAGCAAC CTCCGCCTCC CAGGTTCAAG CCATTCTCCT 
GCCTCAGCCT CCGGAGTAGC TGGGATTACA GGCATGCGCC ACCACACCCG GCTAATTTTG 
TATTTTTAGT AGAGACAGGG TTTCTCCATG TTGCTCAGGC TGGTCTCGAA CTCCGGACCT 

25 

CAGGTGATCC ACCCGCCTCA GCCTCCCAAA GTGCTGGGAT TACAGGCGTG AGCCACCGTG 
CCCAGCTAAT TTTTGTATTT TTCATAGAGA CAGGGTTTCA CCATGTTGGC CAGGCTGGTC 
30 TTGAACTCCT GACCTCATGA TCTGCCCGCC TTGACCTCCC AAAGTCCTGA GATTACAGAT 
GTGAGCCTCC GTGCCCAGGC CAAGTCTGGC TAATTTTTAA AAAAATTTTG TAGAGTTGGG 
TCCTCTCTGT TTTGCCCAGT CTTGTCTCAA ACTCCTGGGC TCAAGGAATC CTCCTGCGTT 

35 

GGCCTCTGAA AATGAAAATG TTGGGATTAC AGGCGTGAGC CCCCTGTGCC TGGCTGCCTT 
TTTTTTTTTT TTTTTTTAAA GGCAGAGTCT CACTCCATCG CCCAGGCTGA AGTGCAGTGG 
40 CGTGATCTCT GCTCACTGCA ACCTCTGCCT CTTGAGTTCA AGCGATTCTT CTACCTCAGC 
TTCCCAAGTA GCTGGGATTA CGGGAGCCCA CCAACACACC CAGCTAATTT TTGTATTTTT 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720, 
780 
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AGTAGAGACG GGTTTCACCA GGTTGGCCAG GCTGTCTGGA ATTCCTGACC TCAGGTGATC 8^X) 

CACCCTCCTT GGCCTCCCAA AGAGCTGGGA TTACAAGTGT GAGCCACTGT GCCCAGCCTG 900 

ACTTGTTTTT TATAATGCCT TTTTTTTTTT TTTTTGAGAC GGAGTCTTGC TCTGTCGCCC 960 

AGGCTGGAGT GTAGTGGCGT CATCTCAGCT CACTGTAACC TCCACCTCCT GGGTTGAAGT 1020 

GATtfTTCTCA CCTCAGCCCT CAGCCTCCTG AGTAGTTGGG ACTGCAAGTG CACACCACCA 1080 

TGCCCAGCTA ATTTTTTGTA TTTTAGTAGA GATGGGGTTT CACCATGTTG CCCAGCTGGT 1140 

CTTTAACTCC TGAGCTCAGG CAGTCTGCTT ACCTTGGCCT CCCAAAGTGC TAGGATTAAA 1200 

GGTGTGAGCC ACTGTGCCTG GCCTTTTTTT TTTTTTTTTT TTTTTTGAGC AGTTTTAGTT 1260 

TCCCAGCAGA ATTGAGATGA AGGTACAGAA ACTTCCCATA TGCTTCCCAC ATGCATAGCC 1320 

TTCTACATTA TCGACATCCT CCGCCAGAGT GGTACATTTG TTACAACTGA TGAACCTACA 13 80 

TTGATACATC ATAATCACCC AAAGTCCATA GTTTACATTA GAGTTCACCC TTGGTGTTAT 1440 

ATATTCTATG GGTTTGGACA AATGTATAAT GAGACGTATC TACTATTAAA TACTTTACAG 1500 

AGTATTTTCA CTGGCCTAAT CCAATGGACA TTTATTGTTA CTTCATTATG GTTGGGCACA 1560 

GTGCTAGATG CTGATGATTA AGAGAGGGCA TGGGATTTGG TCTTGTCCTC AAGGGTAGAA 1620 

CCTAGGCCCA TTGCATCTTC AAAGCCCAGG CTCCTTCAAA GCCCAGTGTA GTAGCAACTG 1680 

CTGTACCTTG CCTGTGCCCT TTGCGTATCT CACTCCTCTA TCTCTCTAGA AAGTTGGAGA 1740 

GAAAAGTGAG CAAGGCATGA GGAACAAAGT TATTTATTTA TTCTTCATTC ATCTATTTAT 1800 

TCTTTCATTA CCGTTTGTGT TAAAACATTC CAAACCCAAA CAATTATTTG TATGGTCGCC 1860 

TGTGTATTAC TTGTGGTTTC CCAAGAAGTA GTTGCTAAGC TTTTCCTTGT ATGGTTTCTG 192 0 

TGAGGTAAGG AAGGAATGAT GTGATTTTCT CCAGTATGTA GAATGCAGTT CCAAGAGGTT 1980 

AAGTAATTTA CTTACAGTTA TTTAGCCAAA CAAGGTTACT GCAAGGTATA TGAAGTCAGG 2 040 

TCTCTTGACC CAGTTCATGA GAGAGTTAAA GGAACTATCA TTCTTTTTAG CTTTCATGGA 2100 
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AAAAGAAGGT TGAGTGTTGG GAGGGGTGTG GGTAGGATTG ATAATGGACT TCAAAAATGT 2160 

GAAGGGTATT TCTGTAGTTT TCATTCTTCT GAAAGCCTTC TAAGAGGCAG TGAACCAAAA 2220 

GCACACAAGA ATGGCAAGAA GTTAGCATGC TGAAGAAATA TCCTCCTGGC TGGCAAGCAG 2280 

AGTGAGAAGA CTGCTATCAC CTTTTCTAGA ATCTTTTGGA ATTGTAGGAG CTGTTAGATC 2340 

10 CTGGGTTAAC TCTATGAAGA AAGTCAGAAG GATCAGAGAA CATCAGTGTC ACAGCTCTTC 2400 

ATTGGAATAT CCATGTCTCC TCCTTTACTC TGCTCTACCT TCCATCCTTT GCCACTAATT 2460 

ATCCAGAGTG TTTGTCAAAA TTCTCTGTTT GCAGTTCTGA GCTAGCAACT GTACACACTA 2520 

15 

ACACCATCAG ACACAGCTAA TACCTACTCT AGTCTAGTAG CTTCCGATCT AAGGCAGACA 2580 

CATGGGTATA GTTAAAGATT TTGAATGTAC ATGTGTCCAA TCTGACAACA GTAACACAAA 2640 

20 CCATCCATTC AAGTAGAAGT GATTGAGTCA GAATTGGATT GCACCCCTTC CCCCACACCC 2700 

ACACACATTT CAGTTCTTTC CTCATGATTT TTTCCTCCCA AGACATCCCA GGAATTTGTG 2760 

GAGAAACTAA CCAAGCGACT GAAAAGACAC CCTGAGGAGA CAGGAGGCTT CCAGGAGGCA 2820 

25 

CCGCTGGCCT ATGATGCCAT CTGGGCCTTG GCACTGGCCC TGAACAAGAC ATCTGGAGGA 2880 

GGCGGCCGTT CTGGTGTGCG CCTGGAGGAC TTCAACTACA ACAACCAGAC CATTACCGAC 2940 

30 CAAATCTACC GGGCAATGAA CTCTTCGTCC TTTGAGGGTG TCTCTGTGAG TTAAAACTTC 3000 

CTTCATACTC CCCTGTCTTC CCAATCTTGA GAGAGACTCC CAAGAGGCAC CTTCTACAAA 3060 

CATGCATTCT CTGTTTTTCT CAGTTACTTC TTTGCAGAAT CAGTCTCCGA CCAGAGAAGT 3120 

35 

AGGGACCTTC AAATTAGAAG AACCCATCAA AGACTAGAGG AAAAAAAATG ATGTATTCCA 3180 

TTTTTTTAAA CCCCTCCCCT CATTTCTTTT CAAACTAGAC CAAGTATTCA TGAGTCAGAT 3240 

40 GAGAACTATA GGATTTTGAA AGACAAAACA GTCTGAAAGG TCATCTTCTT ATTCCTTTTA 3300 . 

AAATGAAAAG ATTAGTTTCC AGAGAGATTT GCTGACTTGC TTAGGCCACA CAACCAGAAG 33 60 
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CCTGCTGGTG TTCTGTCTGG GGATTTTTTC CCATTCAAAT CTCATAAGTG AAGCTCCTTC 3^20 

TCCAAAGAAT AATGTTTCTA AAATCTAGGG TATGGGCATC TGGGGTATGT CCTATATGCA 3480 

5 GGCAAATGCC ATAAATAGCA TTCATTCAGA GGCTCAATTA CATCAAAAAC AGAAGGATTT 3540 

AAAGAGTCCC TGATGTTCTC TTTCACTCTT GCTTTTGTCT CCTTTGCCTT GCTCCACATG 3 600 

TTCCTTCCCT CAGGGCCATG TGGTGTTTGA TGCCAGCGGC TCTCGGATGG CATGGACGCT 3 660 

10 

TATCGAGCAG CTTCAGGGTT AGTACAGGGG CAGGAGGGGA CCGGACATGG GGGCTAGGCT 3720 

GGGGCTGGGC TGGGATGCCC CCTGGGGAAG AATGCC AGAG ACATCACAAG ATTGCCCTGG 3780 

15 CACCTCCCAA CTTCTGCCCT TCTCTTTTAA CTCTGTTCAT CAAGCTTGTA AATAATAATA 3 840 

ATAATAAGCT TAACTACAAG AAGATTGATG TCTTTGAGTT GCACTGGTTT TGCTCTTGAA 3900 

AAGAGGTGTG CAGGCTGGGT GTGGTGGCTC ACCCCTGTAA TCCCAGCACT TTTGGGAGGC 3960 

20 

CAAGGCAGGC AGATCATGAT CATGGTCAGG AGTTTGAGAC CAGCCAGACC AACATGGTGA 4020 

AACCTGTCTC T AC C AAAAAT ACAAAAAAAA AAAAAAAAAT TAGCTGGGTG TGGTGGCAGG 4080 

25 TGCCTGTAGT CCCAGCTACT TGGGAGGCTG AGGCAGGAGA ATCACTTGAA CCCAGGAGGC 4140 

AGAGGTTGCA GTGAGCTGAG ATCACGCCAC TGCACTCCAG CCTGGGTGAT AGAGTAAGAC 4200 

TCTGTCTCAA AGAAAAAAGA AAAGAAAAGA GACATGCAAA TTAAAAACAG CTACTCTCTT 4260 

30 

TCCCAGTGGC TTCCATTAAT TTCAGGAATT TCCCCTTGAG TGGCTTGGGT TGAGAGGTTG 4320 

ATGACCTGTC AGTTAGACTC AAGAAAGCTG AATCTAGGAG AACCGCTATT TTTTTTTTAA 43 80 

35 GGGAATCTGC CAAATTTCCT TGCTGTGTAA AGCTTCAATG TGTATAGCTT GGCTTTTG-TA 4440 

GATTGTATTT TCTTGAAACT TAGCACACAG GTATTTGCAG AACTTCTAGG AGTTAATTTT 4500 

TCTGCTCCAC TCGG CTCTCA GTCTTTTACG GCATGGCCAA GAGAGCTATT TCTTGGCCTC 4560 

40 

CTGTGAAAAG TTTCTTTCTT CCTTTCTCCC CACCTCCACA TCCTTTCAGC TCCTCTTTGT 4620 

ATCCAGGACA AGAGGAAATG GACTTCAGCC ATGGTGAAAG GAGTGTGAGT TGGCTTTTGA 4680 
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AGGAAAAGTT ATGGTAACGG AAACAGTTCT AGAACAGAAA TCTTAGAAAT GACCAAATTT 4740 

TACTCAATGG CGCTTTAAGA GGCAGATATA ACTTATCCAA GGAATTAAAA CCCAAGCCAA 4800 

5 

CAGAAGAGAA TGTTCTAAAA TTAAAATGAA AGCCACTGGG AAAATAGAGC CTGCCCATCA 4860 

TGAGAGGAAG AATAAGCAGA AATATGTGTA AAGCTTTAGA AGCCAAAATC AAAGTGAGAG 4920 

10 ACATCTCGCC GAGAGAGGTG TGAGGAATGG AATAGG TGGC AGACATGTTG TGGAGCCTCC 4980 

TCACTGAAGA CTTTTAAACA TAGATATTCT TATTTATTTG AGTTGTCTTG GGAACCACCT 5040 

TATATTGCTT TTAAGTCATG TTGCTGATTC AAGAGTCTCG TAGGTCCTTC CAAGCATCCT 5100 

15 

TAGGGCCTCA GGTGAAAATA AAATCAGATA CAACCATGCA AAGCTCTAGG GAAGTGGGAA 5160 

GTTGAAAATG CCTAGGATCA GCTCTTTGGC TACCTGTGGT CACTCCTTTT ATTGTCGTCT 5220 

20 GCCCAGGTGG CAGCTACAAG AAGATTGGCT ACTATGACAG CACCAAGGAT GATCTTTCCT 5280 

GGTCCAAAAC AGATAAATGG ATTGGTGAGT GGATCTTGTT TGTATTTTCC TTCAGCCCCT 5340 

CTCGACAGTC AAGGGAAAAA GTCATGCCTT TGAGTGAGGA TGGAATGGTA GAGACTGTTA 5400 

25 

GGTTGGAATG TGGC TGGC AG CTGGGCCAGG AGAAAGGGTT AAGTGAGAGT GAATACAACC 5460 

CCTAAGGCGT GGGTAGGGGA GACTGGTGTA TTTGGAGAGG GAATAGGCGG TGGTTAGTAC 5520 

30 TATTTTTAAT GGTGCATTGC TGGGGTAACT GGGGATTAGA GGCAGGGGGT GGGCAGAGGG 5580 

CGGGAAATGG AAACTCCATT TGGGTTTCCC AGATGTCCTG GTGTCTTGAT ATATTTGAAC 5640 

CAGCTACTTC AAGCCCAGAG CTGTCTCTTT GTCTGTCTCT GTCAGGAAAA CGGTTGCTTA 5700 

35 

AACTATGGAG GAGGAGGGAA AACCTCATGT AATTGTCATC TGCCAAAATG TGCTTTTTAT 5760 

TTTTATATGT ATTTTTAAAA ATTTTCCTAT TTTTATGTAA TTTAGAGGTA GACGTGCAGT 5820 

40 TGTGTTACAT GAATATATTG CATAGTGGTG AAGTCCGGGC GTTTAGTGTG CCTGTCACCC 5880 

GAACAGTGCA CCTTGTACCT AATAGGTAGT ATTACATCCC TCAAAATATA CTTTTTAAAG 5940 

HI 865-1 WO SEQ U0 



SUBSTITUTE SHEET (RULE 26) 



WO 99/21890 PCT/SE98/01947 

AGAGAAAGCA AGCAGTTATT CTTTGTGTAC TTGGTCTAAA TGATAGGACA TAGGAGAGAA 6O00 

ACTGAAGGTG GACAAAAGGA AGGACCTACT GATAAAAGAA AGCCTCCTTG AGAATGAAGG 6060 

GGAGGCTCAA CCATTGAAGA TGGCTGCCGT CTGCCCTGCC CAGCAGATAT CCAGTCATTC 612 0 

CCAGCACTGC TGGAGTTTTG CCCTTTTTTT TTTTTTTTTA CAATTCGAAT TTAGGACAAT 6180 

GTTCTGGATT GCTATAAATG CTGCATGGCC TAAATTATTC TTTAAAAAAA AAACTAAGCA 6240 

AATTGAAATT AGTTTTTTTT GGTGAAC TCT GACAAATTTG AACTTCCCCC TAATAATAAC 63 00 

TGGAAAACAT ATTTGGGAAT ATTACCCTGC CAGGATTAAA ATTTCAGATT AGCTTTCCTT 63 60 

CTTTTTTTGT TTGTCTTAAG AATAGGTGTC CACACTAGAT ACTTCAAGGC CTTTTTAGCT 6420 

TTATGATTCC ATAATTGTCA TTTAAAACTT TGATTTGGGT TATAAGAAAC CTTATAACAT 6480 

TTTTTAATGA TCCCCTTCTT TCTCCTCCCA TTTTCCTTTG CTGTAAGAAA GACAGAAAAA 6540 

CTTAAAGAAC AAACAAAAAC AAAGACTACA ACTTTGGGGA CATGCCTCAG CATTTCCCAA 6600 

CCTATGGATA GACCATTCAC TCCATCTTCT CATCTCATTT CTGGTTGCTT CCTAACGGCC 6660 

CCAGTGGCAC TGAGCATTCT GCCTGCAGTA ACCTCTGTCC AGTGCAGTTA GGGCCTCATG 6720 

TCCCCAGCCA ATGACTGAAT GTCCATCAGC AATCTAGTTC TTTGCCCTTT TCTCCTATCC 67 80 

CGTCTTCATT CCTTTGTCCT CCTTCCCTTC TCTTTTCCCT TCCCCTCTTC CTCCCCTGTG 6840 

CCATGCAGGA GGGTCCCCCC CAGCTGACCA GACCCTGGTC ATCAAGACAT TCCGCTTCCT 6900 

GTCACAGAAA CTCTTTATCT CCGTCTCAGT TCTCTCCAGC CTGGGCATTG TCCTAGCTGT 6960 

TGTCTGTCTG TCCTTTAACA TCTACAACTC ACATGTCCGG TAAGTTTCTC TTCTGAGGTT 7020 

TTCCTTGTCT GCCTCTCTGA GATACTGATC ATGTTCCTGG ACAGGATGAG AATAAAACCT 7080 

GTGTAACTCC CATGGCCATG TATCATGGAG TTTTTCATTC TGACTTGTTG AGAATGAAAA 7140 

CAGGGAAACC AG AT AT AAC C CCCACTCCTA CTCCAAAGTA GCTAACGGGA GGAAAAAAGA 7200 

AAAGAAGAGA AAAAAACAAC CTTTGGGGCC AGGTCTCACA GTCTTGGACT CTACATAAAT 7260 
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AGCCTGTATT CTAGTGGGGG CCTGTGCTTG GGAAGCCCTC TGCAACTCCA TCTTCAGCCC 7320 

CATGACTGCA TTGCTCTGCC TCTCAAGGCT CCACTGTCTT CTCCAATCCT GTCTTCCTTT 7380 

AGCCCCTGGC CCTGAAATTA GGGTCATGCC ATTGCGTGGT ATTTGGAGAG CTCAGCCTCC 7440 

CTGGAGAAGA GGGGTAATTC TCTCTCCCTC TCACCCTCTC CACCTCTGCC CTAGTTATAT 7500 

C C AGAACTC A CAGCCCAACC TGAACAACCT GACTGCTGTG GGCTGCTCAC TGGCTTTAGC 7560 

TGCTGTCTTC CCCCTGGGGC TCGATGGTTA CCACATTGGG AGGAACCAGT TTCCTTTCGT 7620 

CTGCCAGGTG AGGAGGTGGT GGGCAAATTC CTTACAGGAT GTGAC TCTCC CACCCGTCTC 7680 

AGGAGCACCT TCCATGATTT ATGATTCTCT GCCCTTCCTC CTCAGCTTTC CCTGACTCTT 7740 

GTCCCTGTTC TTTCCTTCTA GCATCACCCC TCTGTTCTCT GTTTGGCTCT GTCCCTTCTT 7800 

TCTGTGTCTG CAGGCCATTT TCATTCTGTA GTTTACTTGT CAGTTCCAAG GTTGCCATGG 7860 

CAGSCCTYGC AGAGAAGAGG AGGGAGCCAT TGAAGGCAAA GGAAGGGGAT CTGCTCAAAG 7920 

GTCTCCTGAA CAATGGTGGC TTGTCTGTGG TATGGGGGCT GAGAATCAGA ACTGTGGACT 7980 

TTTTTTGGGA GCCTTTGTTG GGTTTGGAAG GATAGAAGCA GAGATGGAAA CACAGCAGAG 8040 

AGTTGGGGGG AAGGGACCAC TGCCACACAG GGGAGGAGGG GCTCTGGGAC TGTTGGTACA 8100 

TGGAAGGTTC TAGTGCTGTG GGGAGAGGCC AGCTTCAACA GTGATAGTTG AGTGGTTCTC 8160 

TTTTCCACTG GTGGAAACAC CCACTCTTTC TCCTGATCTG CCTGCCTGTC CTTGCTCTCT 8220 

CTTTTTCCTC TGCTCTGTGC TGTCCTGATC ATACATCTGT GCACATGGCA TTTCCATGCA 8280 

CATGCACATG CAGTTCATCA GGAATCCTCT GTTCCCAGTG AGGCCAGAGT GCAGCTGGAG 8340 

AAGCAGACAA TTAGCTGTAG TGCAATAGGA GAGGTTCCAG AGTAGGGATC TGCACAAAGT 8400 

GCTTTGGGGG CAAAGAAGGG AACACAGTTC ACTGCTGGCG TGATTGGGTG GACCTCACTG 8460.. 

AAGAGGTGGC ATTTGAATAC TGAAGGACAA ATAGGATTTT ATCAGCTAGA GAAATAGAGG 8520 
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AAGGCTACTT CAGGGGCATA GGGAGCATCG TGTGGCTAGA AAATACATGA AAGAGAGTAG BSBO 

ATGAAGAGAA AGTGAGTAGT TCAGCATGGC TGGAG CGTGG GGTAGGTGTG GGGCTGGGAG 8640 

ATGAGCCTAG CTGGACAGGT GGATGGGAGC ATGTTGTGAA GGGTCTGTGT CATATCCAGA 8700 

AGTGTTCAGG CTATAACTTA TAGATATTGG GGAGTGGTTG GAGGTTTTTG GCCACTAAAG 8760 

CCAGGAGGTT TTAGCAAGAT CACCCTGGTG GTGTGGAAGT AGAGGGTGGA TGGGAGGAAT 8820 

TGTTCAAGGT GGGGAGACTG CTCTCCTCCT GCCGCTCCCC GTCCTGCTCA CATTTTCGCA 8880 

TCCTCCCTGT GCCACCATGA GCTCCCTGCC CGTGCTCCCT GCCCACTCTC CCTTAGGGTT 8940 

CTGCCCATCC TTACTGCAGT CCCGGCTACT ACTCTACCCT GTTCTGCCTG TGCCCTCTCT 9000 

TCCTTTCTAG GCCCGCCTCT GGCTCCTGGG CCTGGGCTTT AGTCTGGGCT ACGGTTCCAT 9060 

GTTCACCAAG ATTTGGTGGG TCCACACGGT CTTCACAAAG AAGGAAGAAA AGAAGGAGTG 9120 

GAGGAAGGTG AGCTGCTGCC CAATCCTCAG CCCCCARATC CTTGGCTCCT GGGGCACAGA 9180 

GCATTTTCCC CTGACGTGCC TGTTCTCCCC ACATATTTAT CCAGACTCTG GAACCCTGGA 9240 

AGCTGTATGC CACAGTGGGC CTGCTGGTGG GCATGGATGT CCTCACTCTC GCCATCTGGC 9300 

AGATCGTGGA CCCTCTGCAC CGGACCATTG AGGTACCACT GGAGAGGAGG TGCTATGGTC 9360 

AGGAGAATGA GCAGGGCTCA GTGGCCATCA GGGCCCTGGG GCTGTGTGTG TCTTGAGGGA 9420 

TGAAGCTACT TGGAGAGAGT GCCTTCCTCG TATTGGAAGC TCTTCCTTTC CTTCCTAGAA 9480 

GGAGCCCCTC ATAGGCCTCC AGATTCAGCT GAAGAAAGGA AGGGGTGGGA ATCTGGGAAG 9540 

GGTGTGTAGA AC TTCC AGGC ATCAGGGAAA GTGGGGAACA AGCACCTCCA AGGGTTCAGG 9600 

AAAACATTCT TAGGCCTAGA ATGAGATTTG GCATCAGCAT TGAGGGTCTC ATAGGAAAAC 9660 

AGTTGGAAGC CAGAGACTGA GAAGCGTTGA GGAGAGGAGG GGAGGCTGGC AACCATCTTT 9720 

CTTGTGACCT TGTTTCTGCC CTAGACATTT GCCAAGGAGG AACCTAAGGA AGATATTGAC 9780 

GTCTCTATTC TGCCCCAGCT GGAGCATTGC AGCTCCAGGA AGATGAATAC ATGGCTTGGT 9840 
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GTGTGGGATG TGGGCAAAGG AGGGCAGGGA TGCACAAAGG CAGGAGGGAA GGCAGGGGTA 9900 

GAGGGCTTGG AGGGAGAGGG GTCTTTGGAA GAGGAGGTAG AGAGCTTGTC AACCCAGTTT 9960 

GAACACCCTA CTCTTTGTTA TTGCACTAAT CTTTTCTGAG AATAGGGGAG AGTTGCTCTT 10020 

TTGCTATGAG GAGCTTAGGG CCCAAAGCAC AGAAAGCACA GATGAAGAAC TTGTGTTCAG 10080 

10 CAGAGGAACA AGTGGGGGTA ACCCCACCTC CAGACTTGAC ATTATCTTTT AGATCCCCCT ' 10140 

TGGCCTTATT AGCATTGTTC GATTCATGGT CACAAATTGC AAACCTACCC TCTGCCTGGA 10200 

AAGCCACCTT CCCACCTGTA GGGTAAGGGT GAGGCATGTG TGGCCCAGAC TGGCCTATTT 10260 

15 

CTAGATATTC AACAAGCCCT TGCCTGACTG ACAGCAGCTT GCCACCATTG CTTTCCTGTG 10320 

TGAATCCCAG GAAAAAGTGA TGTGGTCTGG GCAAGTTGGG TGGACATAAG GGATAGGGGA 10380 

20 CACAGGGTGA GGTTTGCTAG GTCAGAGGGG TTGGATTGGA GAGGAGGGCC CCCTTTCCAT 10440 

TTCAGAGTAG GTGAAGGGCA GAGAGGGGAT GGGGATTGAG TGAGGAGCAT TGTGGTCCTT 10500 

GTTGCTCAAG TGACTCTCTC CTGCCATCCT AGGCATTTTC TATGGTTAAC AAGGGGCTGC 10560 

25 

TGCTGCTGCT GGG AATCTTC CTTGCTTATG AGACCAAGAG TGTGTCCACT GAGAAGATCA 10620 

ATGATCACCG GGCTGTGGGC ATGGCTATCT ACAATGTGGC AGTGAGCACT GACCCCATGG 10680 

30 CATTGACCCT GTAGGCTGAC CACAGCAGCC CAGATATAGA GGACTAGGAA GAATCAATGC 10740 

TAGATCTGGG ATCGGTTGCT TAGAAGTCTT AAAAAGTTTG TTAATTCTTC AGGTCTATAA 10800 

AGCACTTTAC AGTTTACAAA GCTCACTACA GACATTGTAT CATTAATCTT GCAACTACCC 10860 

35 

AGTGAAGTAG ATATTAGTAT CCCCACTTTA TAGGTGAGGA AACAGAAACA CAGAGACGTT 10920 

AAATTGCTTG TCTGTGGTTA ATGGGCTGGA CTCTATTGAC ATTTCCTGCC AGGGACCGAC 10980 

40 TCTGGAGGAC CCGGAATCTG TGCATAGAGA TCCTGGGAGT TCCTGCCTTG AGGGGAGGGG 1104 0 it 

TTAACCAAGA GTGAAAACTG GTTTGGGACA GTTTGAGATT TTTCTCAATC TATATTGAGG 11100 
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ATGATCCTGA ATTTGGATCC TTTTCAAAGG GAAAGTTCAC CAGGAAACTG TCTGC AT AGA 1X1S0 

CTCCCTCCCA TGGGAAGTAA ACTCTGGATC TTGTCTGAGC CTGCAGACCT GAGACTCCCT 11220 

5 CAATGTGTCT TTCCCTCTAG GTCCTGTGCC TCATCACTGC TCCTGTCACC ATGATTCTGT 11280 

CCAGCCAGCA GGATGCAGCC TTTGCCTTTG CCTCTCTTGC CATAGTTTTC TCCTCCTATA 11340 

TCaStCTTGT TGTGCTCTTT GTGCCCAAGG TAAGGATCTG GCTTTTCTCC CACCCTCTTT 11400 

10 

GTTCCCATGT TCCCTCCATC CCTCCTTCCT ATATTACTGA GTTCC TCTGC CCTTCCGTTC 11460 

ACCCTCCTCT CACTCCTCCC CTTGTTTTGG GCCCAACTCT TATCAGCATT CCTTCCACCT 11520 

15 CCAACCTTCC ATCAGCCAGT CACTAGTACA GTCCTTGCTG GGCCACCCCA CGCCCAAACA 11580 

TTTGCCCCCA GATGCGCAGG CTGATCACCC GAGGGGAATG GCAGTCGGAG GCGCAGGACA 11640 

CCATGAAGAC AGGGTCATCG ACCAACAACA ACGAGGAGGA GAAGTCCCGG CTGTTGGAGA 11700 

20 

AGGAGAACCG TGAACTGGAA AAGATCATTG CTGAGGTGCG GGGGTGGGTG TCAGGGTAGG 11760 

GTGTTGGAGT GGTCCAGGAG GCTTGCGTCT TAGCTTGGGT TGTCTGAAGC CAAGCCTGAG . 11820 

25 ATACAGGGTC AGATGTTCTT GGCTCATGGA GGGAGGGTCC TAGGAGACAA CCTGTAAGGA 11880 

GTGAATGGAG CAGCATAGGG GAGGGGAAAG GGCTGAGCAA GATTCTATCT CAGGCAAAAT 11940 

CCAGTGTTGG CCTGGCAGGT GGAAGGGCTC TGGAGTGGGA GCTATGTGGT TGACTCAGCC 12000 

30 

TCCTTAAGGC AAGAGGATGG CTGTTGGCTG TAGGTGACAA CTGGAGAGAG GCAGCTGTGA 12060 
GCCTCTAGTA GTCAACACTC ACAGCAGCTG GGTGTAGCAT GCAGCCCCAG CATAAAGGAC 12120 
35 CTGGGCAGGC GTTCACTGTG CCCCAGGCTG TCATTAGGGG CTGGTGCAAT GCCAAAGAGA 12180 

GGGATGTTCC AACTGGGTTG ACACATCTCT CTGATTTATT GGAAGCTCTG TGCACTGACT * 12240 

TTTCTCTCCT TCCCCACTTT TTCCTTTTGT TTTTAAATTC TCTCTTATTT CCCTGATCGC 12300 

40 1 
ATTTTTTCTA TCGGTATCCT TATGTTCTCT GGCTTTTCTT GTTCTGTTTT GATTTCTCCT 123 60 

TTTAATTTAT TCTGTCCACT TACCCTACGT CCTCCCCCTA CATTTTTCTG TGCCCTTCCT 12420 
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CTCTTTCCCT GTGCCCTTCC TCTCTTTCCC TCCTCCCCAC TCCTTCATCA CCTCCTCTTC 1248a 

TCCTACTATC CCAATTGTGC TTCTTCCTCC AGAAAGAGGA GCGTGTCTCT GAACTGCGCC 12540 

5 

ATCAACTCCA GTCTCGGCAG CAGCTCCGCT CCCGGCGCCA CCCACCGACA CCCCCAGAAC 12600 

CCTCTGGGGG CCTGCCCAGG GGACCCCCTG AGCCCCCCGA CCGGCTTAGC TGTGATGGGA 12660 

10 GTCGAGTGCA TTTGCTTTAT AAGTGAGGGT AGGGTGAGGG AGGACAGGCC AGTAGGGGGA 12720 

GGGAAAGGGA GAGGGGAAGG GCAGGGGACT CAGGAAGCAG GGGGTCCCCA TCCCCAGCTG 12780 

GGAAGAACAT GCTATCCAAT CTCATCTCTT GTAAATACAT GTCCCCCTGT GAGTTCTGGG 12840 

15 

CTGATTTGGG TCTCTCATAC CTCTGGGAAA CAGACCTTTT TCTCTCTTAC TGCTTCATGT 12900 

AATTTTGTAT CACCTCTTCA CAATTTAGTT CGTACCTGGC TTGAAGCTGC TCACTGCTCA 12960 

20 CACGCTGCCT CCTCAGCAGC CTCACTGCAT CTTTCTCTTC CCATGCAACA CCCTCTTCTA 13 020 

GTTACCACGG CAACCCCTGC AGCTCCTCTG CCTTTGTGCT CTGTTCCTGT CCAGCAGGGG 13080 

TCTCCCAACA AGTGCTCTTT CCACCCCAAA GGGGCCTCTC CTTTTCTCCA CTGTCATAAT 13140 

25 

CTCTTTCCAT CTTACTTGCC CTTCTATACT TTCTCACATG TGGCTCCCCC TGAATTTTGC 13200 

TTCCTTTGGG AGCTCATTCT TTTCGCCAAG GCTCACATGC TCCTTGCCTC TGCTCTGTGC 132 60 

30 ACTCACGCTC AGCACACATG CATCCTCCCC TCTCCTGCGT GTGCCCACTG AACATGCTCA 13320 

TGTGTACACA CGCTTTTCCC GTATGCTTTC TTCATGTTCA GTCACATGTG CTCTCGGGTG 133 80 

CCCTGCATTC ACAGCTACGT GTGCCCCTCT CATGGTCATG GGTCTGCCCT TGAGCGTGTT 13 440 

35 

TGGGTAGGCA TGTGCAATTT GTCTAGCATG CTGAGTCATG TCTTTCCTAT TTGCACACGT 13500 

CCATGTTTAT CCATGTACTT TCCCTGTGTA CCCTCCATGT ACCTTGTGTA CTTTCTTCCC 13560 

40 TTAAATCATG GTATTCTTCT GACAGAGCCA TATGTACCCT ACCCTGCACA TTGTTATGCA 13 620. 

CTTTTCCCCA ATTCATGTTT GGTGGGGCCA TCCACACCCT CTCCTTGTCA CAGAATCTCC 13 680 
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ATTTCTGCTC AGATTCCCCC CATCTCCATT GCATTCATGT ACTACCCTCA GTCTACACTC 13740 

ACAATCATCT TCTCCCAAGA CTGCTCCCTT TTGTTTTGTG TTTTTTTGAG GGGAATTAAG 13800 

GAAAAATAAG TGGGGGCAGG TTTGGAGAGC TGCTTCCAGT GGATAGTTGA TGAGAATCCT 13860 

GACCAAAGGA AGGCACCCTT GACTGTTGGG ATAGACAGAT GGACCTATGG GGTGGGAGGT 13920 
GGTGTCCCTT TCACACTGTG GTGTCTCTTG GGGAAGGATC TCCCCGAATC TCAATAAACC ' 13980 

AGTGAACAGT GTGACTCGGC ACCTTGCAGT CTTCCTGTGA ACAGAATGGG CTTCAATCCA 14040 

AGAAGGGAGG CTCAGAGGAC TCCAAGTTCA TGAAAAGGCA TTAAAGCGGA GGGTGAAAAG 14100 

AGGTGTTTTA TTGATCCATT GAGGGCTTAG CAGAATGAAG CAGGACATGA TTAAGTCTGA 14160 

GATTAGTGAG TGAGGACACT ACTGGTTAAA AGTGTGGGCT CTGGAGTCAG ACTGCCAGGG 14220 

TATCAGATCC AACCACATGC AAACATTTTC TTAGTCTCTA TTCCCCATGT CCTCATTTAT 14280 

GAAAATGAGA ATAACAGTAA TACATTCCTC CATAGGTTGG GTACAAAGAC TATTATAAAT 14340 

TGTGCATTCA GGTGCCTAGG TTGGCCCTTG GGCCATGGTA TATGTTGCGT GAATGTTAGC 14400 

CTCTGTCCCT GCTGTTTAAT GAGTTCCTTG ACAGTAGTGG GCATGTATTG GGAGCCTGGA 14460 

GCAAGTGCCT AAGCATCCCC TCTAGGGACG CTCCTTCCCA GGAACTAAGA AGAGTAAAAG 14520 

AATGATGACT GCTAGAAGGT AATGGATGAG ATGGCTGCTG AGTGCTTCCA ACCTTAAACA 14580 

TCTTTGTTTA GGAACTCTGA GCATCTTGGA AATAATTTGC TATCAAACTG AAAAAAAATC 14640 

TTGAATGGAC AAGGGCAAAA ACATTTGCCT GAGACTTTAA ACATTTTTTG TGTCATCTTG 14700 

GAGAGTGCTT TTTTGAAGCT CAAATTTTCT TTTATTTTGG CACTGATTTT TAAAGTGifTT 14760 

CTCAGATTTT TGTAAGACAG CTGCAAGGGT TGGATGGGCC CTGTCATTCA CTGACCTGTT 14820 
AAGAGCCAAT TTCTGAACTT CCACTAAAGC ATGGGCTGGT TGAATCTTAG ACCAGTACTT 14880 
GAAAAACTTT CCACTGTGAT TATCCACCTT GGACCAGTTG G AC TTAATTT GAGTCTCTCT 14940 
TCTTCCCACA GTGAAATATC CGCTAGGAAA AGAGAAGAAG TCTATGGAGT AGATAAGCCT 15000 
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GCAGTTTGGA AAATGAATAG TTGGCATCAG AACAGCAGCA GGAGGCTGGG TGTGGTGGCT 15060 

CACGCCTGTA ATCCCAGCAC TTTGGGAGGC TGAGGTGGGT GGATCACGAG GTCAGGAGTT 15120 

CAAGACCAGC CTGGCCAAGA TGGCGAAACC CTGTCTCTAC TAAAAATACA AAAATTAGCT 15180 

GGGCGTGATG GTGGGTGCCT GTAATCTCAG CTACTCGGGA GGCTGAGGCA GATAATTGCT 15240 

10 TGAACCCAGG AGATGAAGGT TGCAGTGAGT TTGTGCCACT GCACTCCAGC CTGGGCGACA 15300 

GAGTGAGACT GTCTCAAAAA AAAAAAAAAA GAAAAAAAAA AAAGAAACAA CAGTAGCAGG 15360 

AGCTATAGAA C AG CCCTGGG TAGAACCTAA AAGACCCAAA TTATCATCTC AAACTTGCAT 15420 

15 

TGCACTTAAG TGGGCTGTAA ATTATAAACA AAGGGTGAAA AGTTCTACTG TGGCCAAAGG 15480 

TAAGCCAGAC ACTCTGCTAG CAGGAGTGCA GGAGTCGAGA GCCAAACGGT GCGGCTAGCA 15540 

20 GAGTGCCCAG TGCAAAGGGG TGGGAAGGAG TGAGATTGAG AATATTAAAA AGGTACTTAG 15600 

AAGAGAACTT GTAAGATTTT TACTGGCCAA ATTTAAAACA TGACTGAGCA CTATTTTTCA 15660 

TACAGGCCTC CTACTAATAA GAAAACAATT TTGAGATAAC TACTTATTTG AGTTCACAGT 15720 

25 

TAATGTTCCT GATGATTAAG ATCAGTTGCA AATGTTCATC TGTCAATGCT TATCTACAAT 15780 

GAGACTTCAT GTATTCATTT CTGAAAGTGT CTTTTCAGGG TGAATGGTGC TATTGATTAG 15840 

30 CACTAATACT AATTATTAGT ACATTATATA TAATTACAAT GAGATATACA CACACATACA 15900 

CACACACATA TATACATATA CACACACACA ATGATATATA ATTAATATAT AATTGTACCC 15960 

CAAGGGGTGC AAAGGAGATG TGTTGCCAGG TGGAGAGGCT ACCGCTTGGC AGTTCTGGGA 16020 

35 

GGACTTGCTC CCTGTGCACT GTGAGGCAGG CTTTGCCTTT CAGACCTGCC TTTGGGTAGG 16080 

GTTCAGATCA CTTTCTAACT CTGGAATGTC CTAGAATGTA GACTGCCTGC AGGCTTCCAT 16140 

40 GTCCCCTGCT TTTCCCTGAC TTAGCCTGTT GCCTCCCTGC CCTCCTGTTG GTTGTCTACC 16200^ 

AGTAGAGAGC ACTTTGTGTG CACTTGGCTG CTACATTAGT TAGGTGATCT TCAACAAGTG 16260 
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TTGATGGTGA GTTGCTGTGG CAGGTGCTTT TTTTGGCACT GAGGCAAAAT GGTGAGTAAG 16320 

ATGGCTTTCA AGCGTTGTAC CTTCTCGACG TGGGAAGATG ACCAGTAAGC AGAAAAACAA 16380 

ACGAGATCAC TTAAGAGAGC AACCAGGAGT GTTGGGCATC TCACAGCCAT TAGCTCTGGT 16440 

GTGAAGGACA AATCTAAAAG CAAGGGGACT GTGTGTTCAT TTTCTGGGGT CACAAAACTA 16500 

AGGAGCAAAG CCAGTATTCA AACTGCATAT ATTATTATCT ATTGCCACAA AACCTGTTAC . 16560 

CCTAAATGGC TTCAGACAAC AATAATCATT TATTATCCAT CATGGTTTTT GTGGGCTAGA 16620 

AATTCAGATA GGGTACAGTG GGGAGGGTTC ACCCTGCTCC ACAGCATCTG GAAGATCTGA 16680 

AGTCTGAGGG TTGTTGTCCA AAAGATC 16707 



(2) INFORMATION FOR SEQ ID NO: 62: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 50 base pairs 

(B) TYPE: nucleic acid 

( C ) STRANDEDNESS : s ingle 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 62: 
ATGCGCGCCG GCAGCCAACA TGCTGCTGCT GCTGCTGGTG CCTCTCTTCC 
(2) INFORMATION FOR SEQ ID NO: 63: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 63: 
GGTCATCCAG CGTTGAGGTG AAGAC 



(2) INFORMATION FOR SEQ ID NO: 64: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 
<D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 64: 
GAAGGTTGCC AGATTATACA TCCGC 



(2) INFORMATION FOR SEQ ID NO: 65: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 base pairs 

(B) TYPE: nucleic acid 
(C> STRANDEDNESS : single 
{ D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 65: 
CCACGATGAT TCGAGCATCT TGACG 

(2) INFORMATION FOR SEQ ID NO: 66: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 66: 
CTGGTTCCTC CCAATGTG 

(2) INFORMATION FOR SEQ ID NO: 67: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE : nucleic acid 

(C ) STRANDEDNESS : single 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 67: 
CCAGTGGACT ATGAGATTGA G 

(2) INFORMATION FOR SEQ ID NO: 68: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 68: 
CTGGTTCCTC CCAATGTG 

(2) INFORMATION FOR SEQ ID NO: 69: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 
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(C) STRANDEDNESS : single — 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 69: 
CCAGTGGACT ATGAGATTGA G 21 

(2) INFORMATION FOR SEQ ID NO : 70: 



(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 2700 base pairs 
15 <B) TYPE: nucleic acid 

{ C ) STRANDEDNES S : s ing 1 e 
(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: cDNA to mRNA 

( i i i ) HYPOTHETICAL : NO 

(iv) ANTI- SENSE: NO 

25 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo sapiens 

(ix) FEATURE: 

(A) NAME/ KEY: CDS 
30 <B) LOCATION: 1 . .2700 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 70: 

ATG TTG CTG CTG CTG CTA CTG GCG CCA CTC TTC CTC CGC CCC CCG GGC 48 
35 Met Leu Leu Leu Leu Leu Leu Ala Pro Leu Phe Leu Arg Pro Pro Gly * 
15 10 15 

GCG GGC GGG GCG CAG ACC CCC AAC GCC ACC TCA GAA GGT TGC CAG ATC 96 
Ala Gly Gly Ala Gin Thr Pro Asn Ala Thr Ser Glu Gly Cys Gin lie 
40 20 25 30 

ATA CAC CCG CCC TGG GAA GGG GGC ATC AGG TAC CGG GGC CTG ACT CGG 144 
lie His Pro Pro Trp Glu Gly Gly lie Arg Tyr Arg Gly Leu Thr Arg 
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35 40 45 

GAC CAG GTG AAG GCT ATC AAC TTC CTG CCA GTG GAC TAT GAG ATT GAG 192 
Asp Gin Val Lys Ala lie Asn Phe Leu Pro Val Asp Tyr Glu lie Glu 
5 50 55 60 

TAT GTG TGC CGG GGG GAG CGC GAG GTG GTG GGG CCC AAG GTC CGC AAG 240 

Tyr Val Cys Arg Gly Glu Arg Glu Val Val Gly Pro Lys Val Arg Lys 

65^ 70 75 80 

10 

TGC CTG GCC AAC GGC TCC TGG ACA GAT ATG GAC ACA CCC AGC CGC TGT 288 

Cys Leu Ala Asn Gly Ser Trp Thr Asp Met Asp Thr Pro Ser Arg Cys 

85 90 95 

15 GTG AAT CGA ACG CCA CAC TCA GAA CGG CGC GCA GTG TAC ATC GGG GCA 33 6 

Val Asn Arg Thr Pro His Ser Glu Arg Arg Ala Val Tyr He Gly Ala 
100 105 110 

CTG TTT CCC ATG AGC GGG GGC TGG CCA GGG GGC CAG GCC TGC CAG CCC 3 84 

20 Leu Phe Pro Met Ser Gly Gly Trp Pro Gly Gly Gin Ala Cys Gin Pro 
115 120 125 

GCG GTG GAG ATG GCG CTG GAG GAC GTG AAT AGC CGC AGG GAC ATC CTG 432 
Ala Val Glu Met Ala Leu Glu Asp Val Asn Ser Arg Arg Asp He Leu 
25 130 135 140 

CCG GAC TAT GAG CTC AAG CTC ATC CAC CAC GAC AGC AAG TGT GAT CCA 480 
Pro Asp Tyr Glu Leu Lys Leu He His His Asp Ser Lys Cys Asp Pro 
145 150 155 160 



30 



35 



40 



GGC CAA GCC ACC AAG TAC CTA TAT GAG CTG CTC TAC AAC GAC CCT ATC 52 8 

Gly Gin Ala Thr Lys Tyr Leu Tyr Glu Leu Leu Tyr Asn Asp Pro He 
165 170 175 

AAG ATC ATC CTT ATG CCT GGC TGC AGC TCT GTC TCC ACG CTG GTG GCT^ 57 6 

Lys He He Leu Met Pro Gly Cys Ser Ser Val Ser Thr Leu Val Ala 
180 185 190 

GAG GCT GCT AGG ATG TGG AAC CTC ATT GTG CTT TCC TAT GGC TCC AGC 624 
Glu Ala Ala Arg Met Trp Asn Leu He Val Leu Ser Tyr Gly Ser Ser v 
195 200 205 

TCA CCA GCC CTG TCA AAC CGG CAG CGT TTC CCC ACT TTC TTC CGA ACG 672 
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Ser Pro Ala Leu Ser Asn Arg Gin Arg Phe Pro Thr Phe Phe Arg Thr ~~ 
210 215 220 

CAC CCA TCA GCC ACA CTC CAC AAC CCT ACC CGC GTG AAA CTC TTT GAA 720 
5 His Pro Ser Ala Thr Leu His Asn Pro Thr Arg Val Lys Leu Phe Glu 
225 230 235 240 

AAG TGG GGC TGG AAG AAG ATT GCT ACC ATC CAG CAG ACC ACT GAG GTC 768 
Lys Trp Gly Trp Lys Lys lie Ala Thr lie Gin Gin Thr Thr Glu Val 
10 245 - 250 255 

TTC ACT TCG ACT CTG GAC GAC CTG GAG GAA CGA GTG AAG GAG GCT GGA 816 

Phe Thr Ser Thr Leu Asp Asp Leu Glu Glu Arg Val Lys Glu Ala Gly 
260 265 270 

15 

ATT GAG ATT ACT TTC CGC CAG AGT TTC TTC TCA GAT CCA GCT GTG CCC 864 

lie Glu lie Thr Phe Arg Gin Ser Phe Phe Ser Asp Pro Ala Val Pro 
275 280 285 

20 GTC AAA AAC CTG AAG CGC CAG GAT GCC CGA ATC ATC GTG GGA CTT TTC 912 
Val Lys Asn Leu Lys Arg Gin Asp Ala Arg lie He Val Gly Leu Phe 
290 295 300 



25 



TAT GAG ACT GAA GCC CGG AAA GTT TTT TGT GAG GTG TAC AAG GAG CGT 960 
Tyr Glu Thr Glu Ala Arg Lys Val Phe Cys Glu Val Tyr Lys Glu Arg 
305 310 315 320 



CTC TTT GGG AAG AAG TAC GTC TGG TTC CTC ATT GGG TGG TAT GCT GAC 1008 

Leu Phe Gly Lys Lys Tyr Val Trp Phe Leu He Gly Trp Tyr Ala Asp 
30 3 2 5 3 3 0 3 3 5 

AAT TGG TTC AAG ATC TAC GAC CCT TCT ATC AAC TGC ACA GTG GAT GAG 1056 

Asn Trp Phe Lys lie Tyr Asp Pro Ser He Asn Cys Thr Val Asp Glu 
340 345 350 

35 

ATG ACT GAG GCG GTG GAG GGC CAC ATC ACA ACT GAG ATT GTC ATG CTG 1104 

Met Thr Glu Ala Val Glu Gly His He Thr Thr Glu He Val Met Leu 

355 360 365 

40 AAT CCT GCC AAT ACC CGC AGC ATT TCC AAC ATG ACA TCC CAG GAA TTT 1152. 

Asn Pro Ala Asn Thr Arg Ser He Ser Asn Met Thr Ser Gin Glu Phe 
370 375 380 
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GTG GAG AAA CTA ACC AAG CGA CTG AAA 
Val Glu Lys Leu Thr Lys Arg Leu Lys 
385 390 

GGC TTC CAG GAG GCA CCG CTG GCC TAT 
Gly Phe Gin Glu Ala Pro Leu Ala Tyr 
405 

CTG GCC CTG AAC AAG ACA TCT GGA GGA 
Leu Ala Leu Asn Lys Thr Ser Gly Gly 
420 425 



AGA CAC CCT GAG GAG ACA GGA 12TT0 
Arg His Pro Glu Glu Thr Gly 
395 400 

GAT GCC ATC TGG GCC TTG GCA 1248 
Asp Ala lie Trp Ala Leu Ala 
410 415 

GGC GGC CGT TCT GGT GTG CGC 1296 
Gly Gly Arg Ser Gly Val Arg 
430 



CTG GAG GAC TTC AAC TAC AAC AAC CAG ACC ATT ACC GAC CAA ATC TAC 13 44 

Leu Glu Asp Phe Asn Tyr Asn Asn Gin Thr lie Thr Asp Gin lie Tyr 
435 440 445 

CGG GCA ATG AAC TCT TCG TCC TTT GAG GGT GTC TCT GGC CAT GTG GTG 1392 
Arg Ala Met Asn Ser Ser Ser Phe Glu Gly Val Ser Gly His Val Val 
450 455 460 

TTT GAT GCC AGC GGC TCT CGG ATG GCA TGG ACG CTT ATC GAG CAG CTT 1440 
Phe Asp Ala Ser Gly Ser Arg Met Ala Trp Thr Leu lie Glu Gin Leu 
465 470 475 480 

CAG GGT GGC AGC TAC AAG AAG ATT GGC TAC TAT GAC AGC ACC AAG GAT 1488 
Gin Gly Gly Ser Tyr Lys Lys lie Gly Tyr Tyr Asp Ser Thr Lys Asp 
485 490 495 

GAT CTT TCC TGG TCC AAA ACA GAT AAA TGG ATT GGA GGG TCC CCC CCA 153 6 

Asp Leu Ser Trp Ser Lys Thr Asp Lys Trp lie Gly Gly Ser Pro Pro 
500 505 510 



GCT GAC CAG ACC CTG GTC ATC AAG ACA TTC CGC TTC CTG TCA CAG AAA 1584 
Ala Asp Gin Thr Leu Val lie Lys Thr Phe Arg Phe Leu Ser Gin Lys 
515 520 525 

CTC TTT ATC TCC GTC TCA GTT CTC TCC AGC CTG GGC ATT GTC CTA GCT 163 2 

Leu Phe lie Ser Val Ser Val Leu Ser Ser Leu Gly lie Val Leu Ala 
530 535 540 

GTT GTC TGT CTG TCC TTT AAC ATC TAC AAC TCA CAT GTC CGT TAT ATC 168 0 

Val Val Cys Leu Ser Phe Asn lie Tyr Asn Ser His Val Arg Tyr lie 
545 550 555 560 

HI 865-1 WO SEQ 125 

SUBSTITUTE SHEET (RULE 26) 



10 



WO 99/21890 PCT/SE98/01947 



CAG AAC TCA CAG CCC AAC CTG AAC AAC CTG ACT GCT GTG GGC TGC TCA 1728 
Gin Asn Ser Gin Pro Asn Leu Asn Asn Leu Thr Ala Val Gly Cys Ser 
565 570 575 

CTG GCT TTA GCT GCT GTC TTC CCC CTG GGG CTC GAT GGT TAC CAC ATT 1776 
Leu Ala Leu Ala Ala Val Phe Pro Leu Gly Leu Asp Gly Tyr His lie 
580 585 590 

GGG AGG AAC CAG TTT CCT TTC GTC TGC CAG GCC CGC CTC TGG CTC CTG 1824 
Gly Arg Asn Gin Phe Pro Phe Val Cys Gin Ala Arg Leu Trp Leu Leu 
595 600 605 

GGC CTG GGC TTT AGT CTG GGC TAC GGT TCC ATG TTC ACC AAG ATT TGG 1872 
15 Gly Leu Gly Phe Ser Leu Gly Tyr Gly Ser Met Phe Thr Lys lie Trp 
610 615 620 

TGG GTC CAC ACG GTC TTC ACA AAG AAG GAA GAA AAG AAG GAG TGG AGG 1920 
Trp Val His Thr Val Phe Thr Lys Lys Glu Glu Lys Lys Glu Trp Arg 
20 625 630 635 640 

AAG ACT CTG GAA CCC TGG AAG CTG TAT GCC ACA GTG GGC CTG CTG GTG 1968 

Lys Thr Leu Glu Pro Trp Lys Leu Tyr Ala Thr Val Gly Leu Leu Val 

645 650 655 

25 

GGC ATG GAT GTC CTC ACT CTC GCC ATC TGG CAG ATC GTG GAC CCT CTG 2016 

Gly Met Asp Val Leu Thr Leu Ala lie Trp Gin lie Val Asp Pro Leu 
660 665 670 

30 CAC CGG ACC ATT GAG ACA TTT GCC AAG GAG GAA CCT AAG GAA GAT ATT 2064 
His Arg Thr lie Glu Thr Phe Ala Lys Glu Glu Pro Lys Glu Asp lie 
675 680 685 

GAC GTC TCT ATT CTG CCC CAG CTG GAG CAT TGC AGC TCC AGG AAG ATG 2112 
35 Asp Val Ser lie Leu Pro Gin Leu Glu His Cys Ser Ser Arg Lys Met> 
690 695 700 

AAT ACA TGG CTT GGC ATT TTC TAT GGT TAC AAG GGG CTG CTG CTG CTG 2160 
Asn Thr Trp Leu Gly lie Phe Tyr Gly Tyr Lys Gly Leu Leu Leu Leu 
40 705 710 715 720 

CTG GGA ATC TTC CTT GCT TAT GAG ACC AAG AGT GTG TCC ACT *GAG AAG 2208 
Leu Gly lie Phe Leu Ala Tyr Glu Thr Lys Ser Val Ser Thr Glu Lys 
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725 730 735 

ATC AAT GAT CAC CGG GCT GTG GGC ATG GCT ATC TAC AAT GTG GCA GTC 2256 
lie Asn Asp His Arg Ala Val Gly Met Ala lie Tyr Asn Val Ala Val 
740 745 750 

CTG TGC CTC ATC ACT GCT CCT GTC ACC ATG ATT CTG TCC AGC CAG CAG 23 04 

Leu Cys Leu He Thr Ala Pro Val Thr Met He Leu Ser Ser Gin Gin 
- 755 760 765 

GAT GCA GCC TTT GCC TTT GCC TCT CTT GCC ATA GTT TTC TCC TCC TAT 23 52 

Asp Ala Ala Phe Ala Phe Ala Ser Leu Ala He Val Phe Ser Ser Tyr 
770 775 780 

ATC ACT CTT GTT GTG CTC TTT GTG CCC AAG ATG CGC AGG CTG ATC ACC 2400 
He Thr Leu Val Val Leu Phe Val Pro Lys Met Arg Arg Leu He Thr 
785 790 795 800 

CGA GGG GAA TGG CAG TCG GAG GCG CAG GAC ACC ATG AAG ACA GGG TCA 2448 
Arg Gly Glu Trp Gin Ser Glu Ala Gin Asp Thr Met Lys Thr Gly Ser 
805 810 815 

TCG ACC AAC AAC AAC GAG GAG GAG AAG TCC CGG CTG TTG GAG AAG GAG 2496 
Ser Thr Asn Asn Asn Glu Glu Glu Lys Ser Arg Leu Leu Glu Lys Glu 
820 825 830 

AAC CGT GAA CTG GAA AAG ATC ATT GCT GAG AAA GAG GAG CGT GTC TCT 2544 
Asn Arg Glu Leu Glu Lys He He Ala Glu Lys Glu Glu Arg Val Ser 
835 840 845 

GAA CTG CGC CAT CAA CTC CAG TCT CGG CAG CAG CTC CGC TCC CGG CGC 2592 
Glu Leu Arg His Gin Leu Gin Ser Arg Gin Gin Leu Arg Ser Arg Arg 
' 850 855 860 

CAC CCA CCG ACA CCC CCA GAA CCC TCT GGG GGC CTG CCC AGG GGA CCC 2640 
His Pro Pro Thr Pro Pro Glu Pro Ser Gly Gly Leu Pro Arg Gly Pro 
865 870 875 880 

CCT GAG CCC CCC GAC CGG CTT AGC TGT GAT GGG AGT CGA GTG CAT TTG 2688 
Pro Glu Pro Pro Asp Arg Leu Ser Cys Asp Gly Ser Arg Val His Leu 
885 890 895 



CTT TAT AAG TGA 



2700 
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Leu Tyr Lys * — 



5 (2) INFORMATION FOR SEQ ID NO: 71: 

( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 899 amino acids 

(B) TYPE: amino acid 
10 (D) TOPOLOGY.: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 71: 

15 

Met Leu Leu Leu Leu Leu Leu Ala Pro Leu Phe Leu Arg Pro Pro Gly 
15 10 15 

Ala Gly Gly Ala Gin Thr Pro Asn Ala Thr Ser Glu Gly Cys Gin lie 
20 20 25 30 

lie His Pro Pro Trp Glu Gly Gly lie Arg Tyr Arg Gly Leu Thr Arg 
35 40 45 

25 Asp Gin Val Lys Ala lie Asn Phe Leu Pro Val Asp Tyr Glu lie Glu 
50 55 60 

Tyr Val Cys Arg Gly Glu Arg Glu Val Val Gly Pro Lys Val Arg Lys 
65 70 75 80 

30 

Cys Leu Ala Asn Gly Ser Trp Thr Asp Met Asp Thr Pro Ser Arg Cys 
85 90 95 

Val Asn Arg Thr Pro His Ser Glu Arg Arg Ala Val Tyr lie Gly Ala 
35 1 00 1 05 no 

Leu Phe Pro Met Ser Gly Gly Trp Pro Gly Gly Gin Ala Cys Gin Pro 
115 120 125 

40 Ala Val Glu Met Ala Leu Glu Asp Val Asn Ser Arg Arg Asp lie Leu 
130 135 140 

Pro Asp Tyr Glu Leu Lys Leu lie His His Asp Ser Lys Cys Asp Pro 
HI 865-1 WO SEQ 128 " 
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145 



150 



155 



PCT/SE98/01947 

160 



Gly Gin Ala Thr Lys Tyr Leu Tyr Glu Leu Leu Tyr Asn Asp Pro lie 
165 170 175 

5 

Lys lie lie Leu Met Pro Gly Cys Ser Ser Val Ser Thr Leu Val Ala 
180 185 190 

Glu Ala Ala Arg Met Trp Asn Leu lie Val Leu Ser Tyr Gly Ser Ser 
10 195 200 205 

Ser Pro Ala Leu Ser Asn Arg Gin Arg Phe Pro Thr Phe Phe Arg Thr 
210 215 220 

15 His Pro Ser Ala Thr Leu His Asn Pro Thr Arg Val Lys Leu Phe Glu 
225 230 235 240 

Lys Trp Gly Trp Lys Lys lie Ala Thr lie Gin Gin Thr Thr Glu Val 
245. 250 255 

20 

Phe Thr Ser Thr Leu Asp Asp Leu Glu Glu Arg Val Lys Glu Ala Gly 
260 265 270 

lie Glu lie Thr Phe Arg Gin Ser Phe Phe Ser Asp Pro Ala Val Pro 
25 275 280 285 

Val Lys Asn Leu Lys Arg Gin Asp Ala Arg lie lie Val Gly Leu Phe 
290 295 300 

30 Tyr Glu Thr Glu Ala Arg Lys Val Phe Cys Glu Val Tyr Lys Glu Arg 
305 310 315 320 

Leu Phe Gly Lys Lys Tyr Val Trp Phe Leu lie Gly Trp Tyr Ala Asp 
325 330 335 

35 

Asn Trp Phe Lys lie Tyr Asp Pro Ser He Asn Cys Thr Val Asp Glu 
340 345 350 

Met Thr Glu Ala Val Glu Gly His lie Thr Thr Glu He Val Met Leu 
40 355 360 365 

Asn Pro Ala Asn Thr Arg Ser He Ser Asn Met Thr Ser Gin Glu Phe 
370 375 380 

HI 865-1 WO SEQ 129 
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Val Glu Lys Leu Thr Lys Arg Leu Lys Arg His Pro Glu Glu Thr Gly 
385 390 395 400 

5 Gly Phe Gin Glu Ala Pro Leu Ala Tyr Asp Ala lie Trp Ala Leu Ala 

405 410 415 

Leu Ala Leu Asn Lys Thr Ser Gly Gly Gly Gly Arg Ser Gly Val Arg 
420 425 430 

10 

Leu Glu Asp Phe Asn Tyr Asn Asn Gin Thr lie Thr Asp Gin lie Tyr 
435 440 445 

Arg Ala Met Asn Ser Ser Ser Phe Glu Gly Val Ser Gly His Val Val 
15 450 455 460 

Phe Asp Ala Ser Gly Ser Arg Met Ala Trp Thr Leu lie Glu Gin Leu 
465 470 475 480 

20 Gin Gly Gly Ser Tyr Lys Lys lie Gly Tyr Tyr Asp Ser Thr Lys Asp 

485 490 495 

Asp Leu Ser Trp Ser Lys Thr Asp Lys Trp lie Gly Gly Ser Pro Pro 
500 505 510 

25 

Ala Asp Gin Thr Leu Val lie Lys Thr Phe Arg Phe Leu Ser Gin Lys 
515 520 525 

Leu Phe lie Ser Val Ser Val Leu Ser Ser Leu Gly lie Val Leu Ala 
30 530 535 540 

Val Val Cys Leu Ser Phe Asn lie Tyr Asn Ser His Val Arg Tyr lie 
545 550 555 560 

35 Gin Asn Ser Gin Pro Asn Leu Asn Asn Leu Thr Ala Val Gly Cys Ser- 

565 570 575 

Leu Ala Leu Ala Ala Val Phe Pro Leu Gly Leu Asp Gly Tyr His lie 
580 585 590 

40 

Gly Arg Asn Gin Phe Pro Phe Val Cys Gin Ala Arg Leu Trp Leu Leu 
595 600 605 
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Gly Leu Gly Phe Ser Leu Gly Tyr Gly Ser Met Phe Thr Lys lie Trp 
610 615 620 

Trp Val His Thr Val Phe Thr Lys Lys Glu Glu Lys Lys Glu Trp Arg 
5 625 630 635 640 

Lys Thr Leu Glu Pro Trp Lys Leu Tyr Ala Thr Val Gly Leu Leu Val 
645 650 655 

10 Gly Met Asp Val Leu Thr Leu Ala lie Trp Gin He Val Asp Pro Leu 
660 665 670 

His Arg Thr He Glu Thr Phe Ala Lys Glu Glu Pro Lys Glu Asp He 
675 680 685 

15 

Asp Val Ser He Leu Pro Gin Leu Glu His Cys Ser Ser Arg Lys Met 
690 695 700 

Asn Thr Trp Leu Gly He Phe Tyr Gly Tyr Lys Gly Leu Leu Leu Leu 
20 705 710 715 720 

Leu Gly He Phe Leu Ala Tyr Glu Thr Lys Ser Val Ser Thr Glu Lys 
725 730 735 

25 He Asn Asp His Arg Ala Val Gly Met Ala He Tyr Asn Val Ala Val 
740 745 750 

Leu Cys Leu He Thr Ala Pro Val Thr Met He Leu Ser Ser Gin Gin 
755 760 765 

30 

Asp Ala Ala Phe Ala Phe Ala Ser Leu Ala He Val Phe Ser Ser Tyr 
770 775 780 

He Thr Leu Val Val Leu Phe Val Pro Lys Met Arg Arg Leu He Thr 
35 785 790 795 800, 

Arg Gly Glu Trp Gin Ser Glu Ala Gin Asp Thr Met Lys Thr Gly Ser 
805 810 815 

40 Ser Thr Asn Asn Asn Glu Glu Glu Lys Ser Arg Leu Leu Glu Lys Glu 
820 825 830 



Asn Arg 



Glu Leu Glu Lys He He Ala Glu Lys Glu Glu Arg Val Ser 
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83 5 

Glu Leu Arg His Gin 
850 

His Pro Pro Thr Pro 
865 

Pro Glu Pro Pro Asp 
885 

Leu Tyr Lys 



840 

Leu Gin Ser Arg Gin Gin 
855 

Pro Glu Pro Ser Gly Gly 
870 875 

Arg Leu Ser Cys Asp Gly 
890 



845 

Leu Arg Ser Arg Arg 
860 

Leu Pro Arg Gly Pro 
880 

Ser Arg Val His Leu 
895 



(2) INFORMATION FOR SEQ ID NO: 72: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH : 2518 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: cDNA to mRNA 

(iii) HYPOTHETICAL: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo sapiens 

(ix) FEATURE: 

(A) NAME /KEY : CDS 

(B) LOCATION : 1 . .294 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 72: 

ATG TTG CTG CTG CTG CTA CTG GCG CCA CTC TTC CTC CGC CCC CCG GGC 48 

Met Leu Leu Leu Leu Leu Leu Ala Pro Leu Phe Leu Arg Pro Pro Gly 

15 10 15 

GCG GGC GGG GCG CAG ACC CCC AAC GCC ACC TCA GAA GGT TGC CAG ATC 96 

Ala Gly Gly Ala Gin Thr Pro Asn Ala Thr Ser Glu Gly Cys Gin lie 



H1865-1 WO SEQ 
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20 



25 



30 
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ATA CAC CCG CCC TGG GAA GGG GGC ATC AGG TAC CGG GGC CTG ACT CGG 144 
lie His Pro Pro Trp Glu Gly Gly lie Arg Tyr Arg Gly Leu Thr Arg 
35 40 45 

GAC CAG GTG AAG GCT ATC AAC TTC CTG CCA GTG GAC TAT GAG ATT GAG 192 
Asp Gin Val Lys Ala lie Asn Phe Leu Pro Val Asp Tyr Glu lie Glu 
50 55 60 

TAT GTG TGC CGG GGG GAG CGC GAG GTG GTG GGG CCC AAG GTC CGC AAG 240 
Tyr Val Cys Arg Gly Glu Arg Glu Val Val Gly Pro Lys Val Arg Lys 
65 70 75 80 

TGC CTG GCC AAC GGC TCC TGG ACA GAT ATG GAC ACA CCC AGC CGC TGT 288 
Cys Leu Ala Asn Gly Ser Trp Thr Asp Met Asp Thr Pro Ser Arg Cys 
85 90 95 

GTG TGA TCCAGGCCAA GCCACCAAGT ACCTATATGA GCTGCTCTAC AACGACCCTA 3 44 
Val * 

TCAAGATCAT CCTTATGCCT GGCTGCAGCT CTGTCTCCAC GCTGGTGGCT GAGGCTGCTA 404 

GGATGTGGAA CCTCATTGTG CTTTCCTATG GCTCCAGCTC ACCAGCCCTG TCAAACCGGC 464 

AGCGTTTCCC CACTTTCTTC CGAACGCACC CATCAGCCAC ACTCCACAAC CCTACCCGCG 524 

TGAAACTCTT TGAAAAGTGG GGCTGGAAGA AGATTGCTAC CATCCAGCAG ACCACTGAGG 584 

TCTTCACTTC GACTCTGGAC GACCTGGAGG AACGAGTGAA GGAGGCTGGA ATTGAGATTA 644 

CTTTCCGCCA GAGTTTCTTC TCAGATCCAG CTGTGCCCGT CAAAAACCTG AAGCGCCAGG 704 

ATGCCCGAAT CATCGTGGGA CTTTTCTATG AGACTGAAGC CCGGAAAGTT TTTTGTGAGG 7 64 

TGTACAAGGA GCGTCTCTTT GGGAAGAAGT ACGTCTGGTT CCTCATTGGG TGGTATGCTG 824 

ACAATTGGTT CAAGATCTAC GACCCTTCTA TCAACTGCAC AGTGGATGAG ATGACTGAGG 884 
CGGTGGAGGG CCACATCACA ACTGAGATTG TCATGCTGAA TCCTGCCAAT ACCCGCAGCA 944 

TTTCCAACAT GACATCCCAG GAATTTGTGG AGAAACTAAC CAAGCGACTG AAAAGACACC 1004 
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CTGAGGAGAC 



AGGAGGCTTC 



CAGGAGGCAC 



CGCTGGCCTA TGATGCCATC TGGGCCTTGG 



1064 



CACTGGCCCT GAACAAGACA TCTGGAGGAG GCGGCCGTTC TGGTGTGCGC CTGGAGGACT 1124 

TCAACTACAA CAACCAGACC ATTACCGACC AAATCTACCG GGCAATGAAC TCTTCGTCCT 1184 

TTGAGGGTGT CTCTGGCCAT GTGGTGTTTG ATGCCAGCGG CTCTCGGATG GCATGGACGC 1244 

TTATCGAGCA GCTTCAGGGT GGCAGCTACA AGAAGATTGG CTACTATGAC AGCACCAAGG 13 04 

ATGATCTTTC CTGGTCCAAA ACAGATAAAT GGATTGGAGG GTCCCCCCCA GCTGACCAGA 1364 

CCCTGGTCAT CAAGACATTC CGCTTCCTGT CACAGAAACT CTTTATCTCC GTCTCAGTTC 1424 

TCTCCAGCCT GGGCATTGTC CTAGCTGTTG TCTGTCTGTC CTTTAACATC TACAACTCAC 1484 

ATGTCCGTTA TATCCAGAAC TCACAGCCCA ACCTGAACAA CCTGACTGCT GTGGGCTGCT 1544 

CACTGGCTTT AGCTGCTGTC TTCCCCCTGG GGCTCGATGG TTACCACATT GGGAGGAACC 1604 

AGTTTCCTTT CGTCTGCCAG GCCCGCCTCT GGCTCCTGGG CCTGGGCTTT AGTCTGGGCT 1664 

ACGGTTCCAT GTTCACCAAG ATTTGGTGGG TCCACACGGT CTTCACAAAG AAGGAAGAAA 1724 

AGAAGGAGTG GAGGAAGACT CTGGAACCCT GGAAGCTGTA TGCCACAGTG GGCCTGCTGG 1784 

TGGGCATGGA TGTCCTCACT CTCGCCATCT GGCAGATCGT GGACCCTCTG CACCGGACCA 1844 

TTGAGACATT TGCCAAGGAG GAACCTAAGG AAGATATTGA CGTCTCTATT CTGCCCCAGC 1904 

TGGAGCATTG CAGCTCCAGG AAGATGAATA CATGGCTTGG CATTTTCTAT GGTTACAAGG 1964 

GGCTGCTGCT GCTGCTGGGA ATCTTCCTTG CTTATGAGAC CAAGAGTGTG TCCACTGAGA 2024 

AGATCAATGA TCACCGGGCT GTGGGCATGG CTATCTACAA TGTGGCAGTC CTGTGCCTCA 2084 

TCACTGCTCC TGTCACCATG ATTCTGTCCA GCCAGCAGGA TGCAGCCTTT GCCTTTGCCT 2144 

CTCTTGCCAT AGTTTTCTCC TCCTATATCA CTCTTGTTGT GCTCTTTGTG CCCAAGATGC 2204., 

GCAGGCTGAT CACCCGAGGG GAATGGCAGT CGGAGGCGCA GGACACCATG AAGACAGGGT 2264 
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CATCGACCAA CAACAACGAG GAGGAGAAGT CCCGGCTGTT GGAGAAGGAG AACCGTGAAC 23"24 
TGGAAAAGAT CATTGCTGAG AAAGAGGAGC GTGTCTCTGA ACTGCGCCAT CAACTCCAGT 23 84 

5 CTCGGCAGCA GCTCCGCTCC CGGCGCCACC CACCGACACC CCCAGAACCC TCTGGGGGCC 2444 
TGCCCAGGGG ACCCCCTGAG CCCCCCGACC GGCTTAGCTG TGATGGGAGT CGAGTGCATT 2504 
TGCTTTATAA GTGA 2518 



to 



15 



20 



25 



30 



35 



40 



(2) INFORMATION FOR SEQ ID NO : 73: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 97 amino acids 

(B) TYPE: amino acid 
{ D ) TOPOLOGY : 1 inear 

<ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 73: 

Met Leu Leu Leu Leu Leu Leu Ala Pro Leu Phe Leu Arg Pro Pro Gly 
15 10 15 

Ala Gly Gly Ala Gin Thr Pro Asn Ala Thr Ser Glu Gly Cys Gin lie 
20 25 30 

lie His Pro Pro Trp Glu Gly Gly lie Arg Tyr Arg Gly Leu Thr Arg 
35 40 45 

Asp Gin Val Lys Ala lie Asn Phe Leu Pro Val Asp Tyr Glu lie Glu 
50 55 60 

Tyr Val Cys Arg Gly Glu Arg Glu Val Val Gly Pro Lys Val Arg Lys 
65 70 75 80 

Cys Leu Ala Asn Gly Ser Trp Thr Asp Met Asp Thr Pro Ser Arg Cys 
85 90 95 



Val 
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(2) INFORMATION FOR SEQ ID NO: 74: 

(i) SEQUENCE CHARACTERISTICS: 
5 (A) LENGTH: 2 679 base pairs 

(B) TYPE: nucleic acid 
<C) STRANDEDNESS: single 
(D) TOPOLOGY: linear 

10 <ii) MOLECULE TYPE: cDNA to mRNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTI- SENSE: NO 

15 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo sapiens 

(ix) FEATURE: 
20 (A) NAME /KEY: CDS 

(B) LOCATION: 1. .2679 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 74: 

25 ATG TTG CTG CTG CTG CTA CTG GCG CCA CTC TTC CTC CGC CCC CCG GGC 48 
Met Leu Leu Leu Leu Leu Leu Ala Pro Leu Phe Leu Arg Pro Pro Gly 
15 10 15 

GCG GGC GGG GCG CAG ACC CCC AAC GCC ACC TCA GAA GGT TGC CAG ATC 96 
30 Ala Gly Gly Ala Gin Thr Pro Asn Ala Thr Ser Glu Gly Cys Gin He 
20 25 30 

ATA CAC CCG CCC TGG GAA GGG GGC ATC AGG TAC CGG GGC CTG ACT CGG 144 
He His Pro Pro Trp Glu Gly Gly He Arg Tyr Arg Gly Leu Thr Arg 
35 3 5 4 0 4 5 

GAC CAG GTG AAG GCT ATC AAC TTC CTG CCA GTG GAC TAT GAG ATT GAG 192 
Asp Gin Val Lys Ala He Asn Phe Leu Pro Val Asp Tyr Glu He Glu 
50 55 60 



40 



TAT GTG TGC CGG GGG GAG CGC GAG GTG GTG GGG CCC AAG GTC CGC AAG 240 
Tyr Val Cys Arg Gly Glu Arg Glu Val Val Gly Pro Lys Val 'Arg Lys 
65 70 75 80 

HI 865-1 WO SEQ 136 ~~ 
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TGC CTG GCC AAC GGC TCC TGG ACA GAT ATG GAC ACA CCC AGC CGC TGT 288 

Cys Leu Ala Asn Gly Ser Trp Thr Asp Met Asp Thr Pro Ser Arg Cys 
85 90 95 

5 

GAA CGG CGC GCA GTG TAC ATC GGG GCA CTG TTT CCC ATG AGC GGG GGC 33 6 

Glu Arg Arg Ala Val Tyr He Gly Ala Leu Phe Pro Met Ser Gly Gly 
100 105 110 

10 TGG CCA GGG GGC CAG GCC TGC CAG CCC GCG GTG GAG ATG GCG CTG GAG 384 
Trp Pro Gly Gly Gin Ala Cys Gin Pro Ala Val Glu Met Ala Leu Glu 
115 120 125 

GAC GTG AAT AGC CGC AGG GAC ATC CTG CCG GAC TAT GAG CTC AAG CTC 432 
15 Asp Val Asn Ser Arg Arg Asp He Leu Pro Asp Tyr Glu Leu Lys Leu 
130 135 140 

ATC CAC CAC GAC AGC AAG TGT GAT CCA GGC CAA GCC ACC AAG TAC CTA 480 
He His His Asp Ser Lys Cys Asp Pro Gly Gin Ala Thr Lys Tyr Leu 
20 145 150 155 160 

TAT GAG CTG CTC TAC AAC GAC CCT ATC AAG ATC ATC CTT ATG CCT GGC 528 
Tyr Glu Leu Leu Tyr Asn Asp Pro He Lys He He Leu Met Pro Gly 
165 170 175 



25 



30 



35 



TGC AGC TCT GTC TCC ACG CTG GTG GCT GAG GCT GCT AGG ATG TGG AAC 576 
Cys Ser Ser Val Ser Thr Leu Val Ala Glu Ala Ala Arg Met Trp Asn 
180 185 190 

CTC ATT GTG CTT TCC TAT GGC TCC AGC TCA CCA GCC CTG TCA AAC CGG 624 
Leu He Val Leu Ser Tyr Gly Ser Ser Ser Pro Ala Leu Ser Asn Arg 
195 200 205 

CAG CGT TTC CCC ACT TTC TTC CGA ACG CAC CCA TCA GCC ACA CTC CAC 672 
Gin Arg Phe Pro Thr Phe Phe Arg Thr His Pro Ser Ala Thr Leu His 
210 215 220 



AAC CCT ACC CGC GTG AAA CTC TTT GAA AAG TGG GGC TGG AAG AAG ATT 720 
Asn Pro Thr Arg Val Lys Leu Phe Glu Lys Trp Gly Trp Lys Lys He 
40 225 230 235 240 

GCT ACC ATC CAG CAG ACC ACT GAG GTC TTC ACT TCG ACT CTG* GAC GAC 7 68 

Ala Thr He Gin Gin Thr Thr Glu Val Phe Thr Ser Thr Leu Asp Asp 

HI 865-1 WOSEQ 137 
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10 



245 250 255 — 

CTG GAG GAA CGA GTG AAG GAG GCT GGA ATT GAG ATT ACT TTC CGC CAG 816 
Leu Glu Glu Arg Val Lys Glu Ala Gly. lie Glu He Thr Phe Arg Gin 
260 265 270 

AGT TTC TTC TCA GAT CCA GCT GTG CCC GTC AAA AAC CTG AAG CGC CAG 864 
Ser Phe Phe Ser Asp Pro Ala Val Pro Val Lys Asn Leu Lys Arg Gin 
' 3 275 280 285 

GAT GCC CGA ATC ATC GTG GGA CTT TTC TAT GAG ACT GAA GCC CGG AAA 912 
Asp Ala Arg He He Val Gly Leu Phe Tyr Glu Thr Glu Ala Arg Lys 
290 295 300 

15 GTT TTT TGT GAG GTG TAC AAG GAG CGT CTC TTT GGG AAG AAG TAC GTC 960 
Val Phe Cys Glu Val Tyr Lys Glu Arg Leu Phe Gly Lys Lys Tyr Val 
305 310 315 320 

TGG TTC CTC ATT GGG TGG TAT GCT GAC AAT TGG TTC AAG ATC TAC GAC 1008 
20 Trp Phe Leu He Gly Trp Tyr Ala Asp Asn Trp Phe Lys He Tyr Asp 

325 330 335 

CCT TCT ATC AAC TGC ACA GTG GAT GAG ATG ACT GAG GCG GTG GAG GGC 1056 
Pro Ser He Asn Cys Thr Val Asp Glu Met Thr Glu Ala Val Glu Gly 
25 340 345 350 

CAC ATC ACA ACT GAG ATT GTC ATG CTG AAT CCT GCC AAT ACC CGC AGC 1104 

His He Thr Thr Glu He Val Met Leu Asn Pro Ala Asn Thr Arg Ser 
355 360 365 

30 

ATT TCC AAC ATG ACA TCC CAG GAA TTT GTG GAG AAA CTA ACC AAG CGA 1152 

He Ser Asn Met Thr Ser Gin Glu Phe Val Glu Lys Leu Thr Lys Arg 
370 375 380 

35 CTG AAA AGA CAC CCT GAG GAG ACA GGA GGC TTC CAG GAG GCA CCG CTG 1200 
Leu Lys Arg His Pro Glu Glu Thr Gly Gly Phe Gin Glu Ala Pro Leu 
385 390 395 400 

GCC TAT GAT GCC ATC TGG GCC TTG GCA CTG GCC CTG AAC AAG ACA TCT 1248 
40 Ala Tyr Asp Ala He Trp Ala Leu Ala Leu. Ala Leu Asn Lys Thr Ser 

405 410 415 

GGA GGA GGC GGC CGT TCT GGT GTG CGC CTG GAG GAC TTC AAC TAC AAC 1296 
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Gly Gly Gly Gly Arg Ser Gly Val Arg Leu Glu Asp Phe Asn Tyr Asn 
420 425 430 

AAC CAG ACC ATT ACC GAC CAA ATC TAG CGG GCA ATG AAC TCT TCG TCC 13 44 

Asn Gin Thr He Thr Asp Gin He Tyr Arg Ala Met Asn Ser Ser Ser 
435 440 445 

TTT GAG GGT GTC TCT GGC CAT GTG GTG TTT GAT GCC AGC GGC TCT CGG 1392 
Phe Glu Gly Val Ser Gly His Val Val Phe Asp Ala Ser Gly Ser Arg 
450 455 460 

ATG GCA TGG ACG CTT ATC GAG CAG CTT CAG GGT GGC AGC TAC AAG AAG 1440 
Met Ala Trp Thr Leu He Glu Gin Leu Gin Gly Gly Ser Tyr Lys Lys 
465 470 475 480 

ATT GGC TAC TAT GAC AGC ACC AAG GAT GAT CTT TCC TGG TCC AAA ACA 1488 
He Gly Tyr Tyr Asp Ser Thr Lys Asp Asp Leu Ser Trp Ser Lys Thr 
485 490 495 

GAT AAA TGG ATT GGA GGG TCC CCC CCA GCT GAC CAG ACC CTG GTC ATC 153 6 

Asp Lys Trp He Gly Gly Ser Pro Pro Ala Asp Gin Thr Leu Val He 
500 505 510 

AAG ACA TTC CGC TTC CTG TCA CAG AAA CTC TTT ATC TCC GTC TCA GTT 1584 
Lys Thr Phe Arg Phe Leu Ser Gin Lys Leu Phe He Ser Val Ser Val 
515 520 525 

CTC TCC AGC CTG GGC ATT GTC CTA GCT GTT GTC TGT CTG TCC TTT AAC 1632 
Leu Ser Ser Leu Gly He Val Leu Ala Val Val Cys Leu Ser Phe Asn 
530 535 540 

ATC TAC AAC TCA CAT GTC CGT TAT ATC CAG AAC TCA CAG CCC AAC CTG 1680 
He Tyr Asn Ser His Val Arg Tyr He Gin Asn Ser Gin Pro Asn Leu 
545 550 555 560 

AAC AAC CTG ACT GCT GTG GGC TGC TCA CTG GCT TTA GCT GCT GTC TTC 1728 
Asn Asn Leu Thr Ala Val Gly Cys Ser Leu Ala Leu Ala Ala Val Phe 
565 570 575 

CCC CTG GGG CTC GAT GGT TAC CAC ATT GGG AGG AAC CAG TTT CCT TTC 1776 
Pro Leu Gly Leu Asp Gly Tyr His He Gly Arg Asn Gin Phe Pro Phe 
580 585 590 
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GTC TGC CAG GCC CGC CTC TGG CTC CTG GGC CTG GGC TTT AGT CTG GGC 
Val Cys Gin Ala Arg Leu Trp Leu Leu Gly Leu Gly Phe Ser Leu Gly 
595 600 605 



18-24 



TAC GGT TCC ATG TTC ACC AAG ATT TGG TGG GTC CAC ACG GTC TTC ACA 
Tyr Gly Ser Met Phe Thr Lys lie Trp Trp Val His Thr Val Phe Thr 
610 615 620 



1872 



AAG AAG GAA GAA AAG AAG GAG TGG AGG AAG ACT CTG GAA CCC TGG AAG 
Lys Lys Glu Glu Lys Lys Glu Trp Arg Lys Thr Leu Glu Pro Trp Lys 
625 630 635 640 



1920 



CTG TAT GCC ACA GTG GGC CTG CTG GTG GGC ATG GAT GTC CTC ACT CTC 
Leu Tyr Ala Thr Val Gly Leu Leu Val Gly Met Asp Val Leu Thr Leu 
645 650 655 



1968 



GCC ATC TGG CAG ATC GTG GAC CCT CTG CAC CGG ACC ATT GAG ACA TTT 
Ala lie Trp Gin lie Val Asp Pro Leu His Arg Thr lie Glu Thr Phe 
660 665 670 



2016 



GCC AAG GAG GAA CCT AAG GAA GAT ATT GAC GTC TCT ATT CTG CCC CAG 
Ala Lys Glu Glu Pro Lys Glu Asp lie Asp Val Ser He Leu Pro Gin 
675 680 685 



2064 



CTG GAG CAT TGC AGC TCC AGG AAG ATG AAT ACA TGG CTT GGC ATT TTC 
Leu Glu His Cys Ser Ser Arg Lys Met Asn Thr Trp Leu Gly He Phe 
690 695 700 



2112 



TAT GGT TAC AAG GGG CTG CTG CTG CTG CTG GGA ATC TTC CTT GCT TAT 
Tyr Gly Tyr Lys Gly Leu Leu Leu Leu Leu Gly He Phe Leu Ala Tyr 
705 710 715 720 



2160 



GAG ACC AAG AGT GTG TCC ACT GAG AAG ATC AAT GAT CAC CGG GCT GTG 
Glu Thr Lys Ser Val Ser Thr Glu Lys He Asn Asp His Arg Ala Val 
725 730 735 



2208 



GGC ATG GCT ATC TAC AAT GTG GCA GTC CTG TGC CTC ATC ACT GCT CCT 
Gly Met Ala He Tyr Asn Val Ala Val Leu Cys Leu He Thr Ala Pro 
740 745 750 



2256 



GTC ACC ATG ATT CTG TCC AGC CAG CAG GAT GCA GCC TTT GCC TTT GCC 
Val Thr Met He Leu Ser Ser Gin Gin Asp Ala Ala Phe Ala*Phe Ala 
755 760 765 



2304 
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TCT CTT GCC ATA GTT 
Ser Leu Ala lie Val 
770 

GTG CCC AAG ATG CGC 
Val Pro Lys Met Arg 
785 

GCG CAG GAC ACC ATG 
Ala Gin Asp Thr Met 
805 

GAG AAG TCC CGG CTG 
Glu Lys Ser Arg Leu 
820 

ATT GCT GAG AAA GAG 
lie Ala Glu Lys Glu 
835 

TCT CGG CAG CAG CTC 
Ser Arg Gin Gin Leu 
850 

CCC TCT GGG GGC CTG 
Pro Ser Gly Gly Leu 
865 

AGC TGT GAT GGG AGT 
Ser Cys Asp Gly Ser 
885 



TTC TCC TCC TAT 
Phe Ser Ser Tyr 
775 

AGG CTG ATC ACC 
Arg Leu lie Thr 
790 

AAG ACA GGG TCA 
Lys Thr Gly Ser 

TTG GAG AAG GAG 
Leu Glu Lys Glu 
825 

GAG CGT GTC TCT 
Glu Arg Val Ser 
840 

CGC TCC CGG CGC 
Arg Ser Arg Arg 
855 

CCC AGG GGA CCC 
Pro Arg Gly Pro 
870 

CGA GTG CAT TTG 
Arg Val His Leu 



ATC ACT CTT GTT GTG 
lie Thr Leu Val Val 
780 

CGA GGG GAA TGG CAG 
Arg Gly Glu Trp Gin 
795 

TCG ACC AAC AAC AAC 
Ser Thr Asn Asn Asn 
810 

AAC CGT GAA CTG GAA 
Asn Arg Glu Leu Glu 
830 

GAA CTG CGC CAT CAA 
Glu Leu Arg His Gin 
845 

CAC CCA CCG ACA CCC 
His Pro Pro Thr Pro 
860 

CCT GAG CCC CCC GAC 
Pro Glu Pro Pro Asp 
875 

CTT TAT AAG TGA 
Leu Tyr Lys * 
890 



CTC TTT 23 52 

Leu Phe 

TCG GAG 2400 
Ser Glu 
800 

GAG GAG 2448 

Glu Glu 

815 

AAG ATC 249 6 

Lys lie 

CTC CAG 2544 
Leu Gin 

CCA GAA 2592 
Pro Glu 

CGG CTT 2640 
Arg Leu 
880 

2679 



(2) INFORMATION FOR SEQ ID NO : 75: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 892 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



H1865-1 WO SEQ 141 

SUBSTITUTE SHEET (RULE 26) 



WO 99/21890 PCT/SE98/01947 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 75: " — 

Met Leu Leu Leu Leu Leu Leu Ala Pro Leu Phe Leu Arg Pro Pro Gly 
15 10 15 

5 

Ala Gly Gly Ala Gin Thr Pro Asn Ala Thr Ser Glu Gly Cys Gin lie 
20 25 30 

lie His Pro Pro Trp Glu Gly Gly lie Arg Tyr Arg Gly Leu Thr Arg 
10 35 , 40 45 

Asp Gin Val Lys Ala lie Asn Phe Leu Pro Val Asp Tyr Glu lie Glu 
50 55 60 

15 Tyr Val Cys Arg Gly Glu Arg Glu Val Val Gly Pro Lys Val Arg Lys 
65 70 75 80 

Cys Leu Ala Asn Gly Ser Trp Thr Asp Met Asp Thr Pro Ser Arg Cys 
85 90 95 

20 

Glu Arg Arg Ala Val Tyr lie Gly Ala Leu Phe Pro Met Ser Gly Gly 
100 105 110 

Trp Pro Gly Gly Gin Ala Cys Gin Pro Ala Val Glu Met Ala Leu Glu 
25 115 120 125 

Asp Val Asn Ser Arg Arg Asp lie Leu Pro Asp Tyr Glu Leu Lys Leu 
130 135 140 

30 lie His His Asp Ser Lys Cys Asp Pro Gly Gin Ala Thr Lys Tyr Leu 
145 150 155 160 

Tyr Glu Leu Leu Tyr Asn Asp Pro lie Lys lie lie Leu Met Pro Gly 
165 170 175 

35 

Cys Ser Ser Val Ser Thr Leu Val Ala Glu Ala Ala Arg Met Trp Asn 
180 185 190 

Leu lie Val Leu Ser Tyr Gly Ser Ser Ser Pro Ala Leu Ser Asn Arg 
40 195 200 205 

Gin Arg Phe Pro Thr Phe Phe Arg Thr His Pro Ser Ala Thr' Leu His 
210 215 220 

HI 865-1 WO SEQ 142 

SUBSTITUTE SHEET (RULE 26) 
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Asn Pro Thr Arg Val Lys Leu Phe Glu Lys Trp Gly Trp Lys Lys lie 
225 230 235 240 

Ala Thr lie Gin Gin Thr Thr Glu Val Phe Thr Ser Thr Leu Asp Asp 
245 250 255 

Leu Glu Glu Arg Val Lys Glu Ala Gly lie Glu lie Thr Phe Arg Gin 
260 265 270 

Ser Phe Phe Ser Asp Pro Ala Val Pro Val Lys Asn Leu Lys Arg Gin 
275 280 285 

Asp Ala Arg lie lie Val Gly Leu Phe Tyr Glu Thr Glu Ala Arg Lys 
290 295 300 

Val Phe Cys Glu Val Tyr Lys Glu Arg Leu Phe Gly Lys Lys Tyr Val 
305 310 315 320 

Trp Phe Leu lie Gly Trp Tyr Ala Asp Asn Trp Phe Lys lie Tyr Asp 
325 330 335 

Pro Ser lie Asn Cys Thr Val Asp Glu Met Thr Glu Ala Val Glu Gly 
340 345 350 

His lie Thr Thr Glu lie Val Met Leu Asn Pro Ala Asn Thr Arg Ser 
355 360 365 

lie Ser Asn Met Thr Ser Gin Glu Phe Val Glu Lys Leu Thr Lys Arg 
370 375 380 

Leu Lys Arg His Pro Glu Glu Thr Gly Gly Phe Gin Glu Ala Pro Leu 
385 390 395 400 

Ala Tyr Asp Ala lie Trp Ala Leu Ala Leu Ala Leu Asn Lys Thr Ser- 
405 410 415 

Gly Gly Gly Gly Arg Ser Gly Val Arg Leu Glu Asp Phe Asn Tyr Asn 
420 425 430 

Asn Gin Thr lie Thr Asp Gin lie Tyr Arg Ala Met Asn Ser Ser Ser 
435 440 445 



H1865-1 WOSEQ 143 
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Phe Glu Gly Val Ser Gly His Val Val Phe Asp Ala Ser Gly Ser Arg 
450 455 460 

Met Ala Trp Thr Leu He Glu Gin Leu Gin Gly Gly Ser Tyr Lys Lys 
5 465 470 475 480 

He Gly Tyr Tyr Asp Ser Thr Lys Asp Asp Leu Ser Trp Ser Lys Thr 
485 490 495 

10 Asp Lys Trp He Gly Gly Ser Pro Pro Ala Asp Gin Thr Leu Val He 
500 505 510 

Lys Thr Phe Arg Phe Leu Ser Gin Lys Leu Phe He Ser Val Ser Val 
515 520 525 

15 

Leu Ser Ser Leu Gly He Val Leu Ala Val Val Cys Leu Ser Phe Asn 
530 535 540 

He Tyr Asn Ser His Val Arg Tyr He Gin Asn Ser Gin Pro Asn Leu 
20 545 550 555 560 

Asn Asn Leu Thr Ala Val Gly Cys Ser Leu Ala Leu Ala Ala Val Phe 
565 570 575 

25 Pro Leu Gly Leu Asp Gly Tyr His He Gly Arg Asn Gin Phe Pro Phe 
580 585 590 

Val Cys Gin Ala Arg Leu Trp Leu Leu Gly Leu Gly Phe Ser Leu Gly 
595 600 605 

30 

Tyr Gly Ser Met Phe Thr Lys He Trp Trp Val His Thr Val Phe Thr 
610 615 620 

Lys Lys Glu Glu Lys Lys Glu Trp Arg Lys Thr Leu Glu Pro Trp Lys 
35 625 630 635 640- 

Leu Tyr Ala Thr Val Gly Leu Leu Val Gly Met Asp Val Leu Thr Leu 
645 650 655 

40 Ala He Trp Gin lie Val Asp Pro Leu His Arg Thr He Glu Thr Phe 
660 665 670 

Ala Lys Glu Glu Pro Lys Glu Asp He Asp Val Ser He Leu Pro Gin 
HI 865-1 WO SEQ 144 
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675 680 685 

Leu Glu His Cys Ser Ser Arg Lys Met Asn Thr Trp Leu Gly lie Phe 
690 695 700 

Tyr Gly Tyr Lys Gly Leu Leu Leu Leu Leu Gly lie Phe Leu Ala Tyr 
705 710 715 720 

Glu-OThr Lys Ser Val Ser Thr Glu Lys lie Asn Asp His Arg Ala Val 
725 730 735 

Gly Met Ala lie Tyr Asn Val Ala Val Leu Cys Leu lie Thr Ala Pro 
740 745 750 

Val Thr Met lie Leu Ser Ser Gin Gin Asp Ala Ala Phe Ala Phe Ala 
755 760 765 

Ser Leu Ala lie Val Phe Ser Ser Tyr lie Thr Leu Val Val Leu Phe 
770 775 780 

Val Pro Lys Met Arg Arg Leu lie Thr Arg Gly Glu Trp Gin Ser Glu 
785 790 795 800 

Ala Gin Asp Thr Met Lys Thr Gly Ser Ser Thr Asn Asn Asn Glu Glu 
805 810 815 

Glu Lys Ser Arg Leu Leu Glu Lys Glu Asn Arg Glu Leu Glu Lys lie 
820 825 830 

lie Ala Glu Lys Glu Glu Arg Val Ser Glu Leu Arg His Gin Leu Gin 
835 840 845 

Ser Arg Gin Gin Leu Arg Ser Arg Arg His Pro Pro Thr Pro Pro Glu 
850 855 860 

Pro Ser Gly Gly Leu Pro Arg Gly Pro Pro Glu Pro Pro Asp Arg Leu 
865 870 875 880 

Ser Cys Asp Gly Ser Arg Val His Leu Leu Tyr Lys 
885 890 



(2) INFORMATION FOR SEQ ID NO: 76: 

HI 865-1 WO SEQ 145 

SUBSTITUTE SHEET (RULE 26) 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2661 base pairs 

(B) TYPE: nucleic acid 

5 (C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA to rnRNA 

!0 (iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 
15 (A) ORGANISM: Homo sapiens 

(ix) FEATURE: 

(A) NAME /KEY: CDS 

(B) LOCATION : 1 . .2661 



20 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 76: 



ATG TTG CTG CTG CTG CTA CTG GCG CCA CTC TTC CTC CGC CCC CCG GGC 48 
Met Leu Leu Leu Leu Leu Leu Ala Pro Leu Phe Leu Arg Pro Pro Gly 
25 1 5 10 15 

GCG GGC GGG GCG CAG ACC CCC AAC GCC ACC TCA GAA GGT TGC CAG ATC 96 

Ala Gly Gly Ala Gin Thr Pro Asn Ala Thr Ser Glu Gly Cys Gin lie 
20 25 30 

30 

ATA CAC CCG CCC TGG GAA GGG GGC ATC AGG TAC CGG GGC CTG ACT CGG 144 

lie His Pro Pro Trp Glu Gly Gly lie Arg Tyr Arg Gly Leu Thr Arg 
35 40 45 

35 GAC CAG GTG AAG GCT ATC AAC TTC CTG CCA GTG GAC TAT GAG ATT GAG- 192 
Asp Gin Val Lys Ala lie Asn Phe Leu Pro Val Asp Tyr Glu lie Glu 
50 55 60 

TAT GTG TGC CGG GGG GAG CGC GAG GTG GTG GGG CCC AAG GTC CGC AAG 240 
40 Tyr Val Cys Arg Gly Glu Arg Glu Val Val Gly Pro Lys Val Arg Lys 
65 70 75 80 

TGC CTG GCC AAC GGC TCC TGG ACA GAT ATG GAC ACA CCC AGC CGC TGT 288 

H1865-1 WO SEQ 146 
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Cys Leu Ala Asn Gly Ser Trp Thr Asp Met Asp Thr Pro Ser Arg Cys ~ 
85 90 95 

GTG AAT CGA ACG CCA CAC TCA GAA CGG CGC GCA GTG TAG ATC GGG GCA 33 6 

Val Asn Arg Thr Pro His Ser Glu Arg Arg Ala Val Tyr He Gly Ala 
100 105 HO 

CTG TTT CCC GCG GTG GAG ATG GCG CTG GAG GAC GTG AAT AGC CGC AGG 3 84 

Leu Phe Pro Ala Val Glu Met Ala Leu Glu Asp Val Asn Ser Arg Arg 
115 120 125 

GAC ATC CTG CCG GAC TAT GAG CTC AAG CTC ATC CAC CAC GAC AGC AAG 43 2 

Asp He Leu Pro Asp Tyr Glu Leu Lys Leu He His His Asp Ser Lys 
130 135 140 

TGT GAT CCA GGC CAA GCC ACC AAG TAC CTA TAT GAG CTG CTC TAC AAC 480 
Cys Asp Pro Gly Gin Ala Thr Lys Tyr Leu Tyr Glu Leu Leu Tyr Asn 
145 150 155 160 

GAC CCT ATC AAG ATC ATC CTT ATG CCT GGC TGC AGC TCT GTC TCC ACG 528 
Asp Pro He Lys He He Leu Met Pro Gly Cys Ser Ser Val Ser Thr 
165 170 175 

CTG GTG GCT GAG GCT GCT AGG ATG TGG AAC CTC ATT GTG CTT TCC TAT 576 
Leu Val Ala Glu Ala Ala Arg Met Trp Asn Leu He Val Leu Ser Tyr 
180 185 190 

GGC TCC AGC TCA CCA GCC CTG TCA AAC CGG CAG CGT TTC CCC ACT TTC 624 
Gly Ser Ser Ser Pro Ala Leu Ser Asn Arg Gin Arg Phe Pro Thr Phe 
195 200 205 

TTC CGA ACG CAC CCA TCA GCC ACA CTC CAC AAC CCT ACC CGC GTG AAA 672 
Phe Arg Thr His Pro Ser Ala Thr Leu His Asn Pro Thr Arg Val Lys 
210 215 220 

CTC TTT GAA AAG TGG GGC TGG AAG AAG ATT GCT ACC ATC CAG CAG ACC 720 
Leu Phe Glu Lys Trp Gly Trp Lys Lys He Ala Thr He Gin Gin Thr 
225 230 235 240 

ACT GAG GTC TTC ACT TCG ACT CTG GAC GAC CTG GAG GAA CGA GTG AAG 7 68 

Thr Glu Val Phe Thr Ser Thr Leu Asp Asp Leu Glu Glu Arg Val Lys 
245 250 *255 
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GAG GCT GGA ATT GAG ATT ACT TTC CGC CAG AGT TTC TTC TCA GAT CCA 8TB 
Glu Ala Gly lie Glu lie Thr Phe Arg Gin Ser Phe Phe Ser Asp Pro 
260 265 270 

5 GCT GTG CCC GTC AAA AAC CTG AAG CGC CAG GAT GCC CGA ATC ATC GTG 864 
Ala Val Pro Val Lys Asn Leu Lys Arg Gin Asp Ala Arg lie lie Val 
275 280 285 

GGA CTT TTC TAT GAG ACT GAA GCC CGG AAA GTT TTT TGT GAG GTG TAC 912 
10 Gly Leu Phe Tyr Glu Thr Glu Ala Arg Lys Val Phe Cys Glu Val Tyr 
290 295 300 

AAG GAG CGT CTC TTT GGG AAG AAG TAC GTC TGG TTC CTC ATT GGG TGG 960 
Lys Glu Arg Leu Phe Gly Lys Lys Tyr Val Trp Phe Leu lie Gly Trp 
15 305 310 315 320 

TAT GCT GAC AAT TGG TTC AAG ATC TAC GAC CCT TCT ATC AAC TGC ACA 1008 

Tyr Ala Asp Asn Trp Phe Lys He Tyr Asp Pro Ser He Asn Cys Thr 
325- 330 . 335 

20 

GTG GAT GAG ATG ACT GAG GCG GTG GAG GGC CAC ATC ACA ACT GAG ATT 1056 

Val Asp Glu Met Thr Glu Ala Val Glu Gly His He Thr Thr Glu He 
340 345 350 

25 GTC ATG CTG AAT CCT GCC AAT ACC CGC AGC ATT TCC AAC ATG ACA TCC 1104 
Val Met Leu Asn Pro Ala Asn Thr Arg Ser He Ser Asn Met Thr Ser 
355 360 365 

CAG GAA TTT GTG GAG AAA CTA ACC AAG CGA CTG AAA AGA CAC CCT GAG 1152 
30 Gin Glu Phe Val Glu Lys Leu Thr Lys Arg Leu Lys Arg His Pro Glu 
370 375 380 

GAG ACA GGA GGC TTC CAG GAG GCA CCG CTG GCC TAT GAT GCC ATC TGG 1200 
Glu Thr Gly Gly Phe Gin Glu Ala Pro Leu Ala Tyr Asp Ala He Trp 
35 3 8 5 3 9 0 3 9 5 4 0 0^ 

GCC TTG GCA CTG GCC CTG AAC AAG ACA TCT GGA GGA GGC GGC CGT TCT 1248 
Ala Leu Ala Leu Ala Leu Asn Lys Thr Ser Gly Gly Gly Gly Arg Ser 
405 410 415 



40 



GGT GTG CGC CTG GAG GAC TTC AAC TAC AAC AAC CAG ACC ATT ACC GAC 1296 

Gly Val Arg Leu Glu Asp Phe Asn Tyr Asn Asn Gin Thr He Thr Asp 
420 425 430 

HI 865-1 WO SEQ 148 
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CAA ATC TAC CGG GCA ATG AAC TCT TCG TCC TTT GAG GGT GTC TCT GGC 1344 
Gin He Tyr Arg Ala Met Asn Ser Ser Ser Phe Glu Gly Val Ser Gly 
435 440 445 

5 

CAT GTG GTG TTT GAT GCC AGC GGC TCT CGG ATG GCA TGG ACG CTT ATC 13 92 

His Val Val Phe Asp Ala Ser Gly Ser Arg Met Ala Trp Thr Leu He 
450 455 460 

10 GAG CAG CTT CAG GGT GGC AGC TAC AAG AAG ATT GGC TAC TAT GAC AGC 1440 
Glu Gin Leu Gin Gly Gly Ser Tyr Lys Lys He Gly Tyr Tyr Asp Ser 
465 470 475 480 

ACC AAG GAT GAT CTT TCC TGG TCC AAA ACA GAT AAA TGG ATT GGA GGG 1488 
15 Thr Lys Asp Asp Leu Ser Trp Ser Lys Thr Asp Lys Trp He Gly Gly 

485 490 495 

TCC CCC CCA GCT GAC CAG ACC CTG GTC ATC AAG ACA TTC CGC TTC CTG 153 6 

Ser Pro Pro Ala Asp Gin Thr Leu Val He Lys Thr Phe Arg Phe Leu 
20 500 505 510 

TCA CAG AAA CTC TTT ATC TCC GTC TCA GTT CTC TCC AGC CTG GGC ATT 1584 

Ser Gin Lys Leu Phe He Ser Val Ser Val Leu Ser Ser Leu Gly He 
515 520 525 

25 

GTC CTA GCT GTT GTC TGT CTG TCC TTT AAC ATC TAC AAC TCA CAT GTC 1632 
Val Leu Ala Val Val Cys Leu Ser Phe Asn He Tyr Asn Ser His Val 
530 535 540 

30 CGT TAT ATC CAG AAC TCA CAG CCC AAC CTG AAC AAC CTG ACT GCT GTG 1680 
Arg Tyr He Gin Asn Ser Gin Pro Asn Leu Asn Asn Leu Thr Ala Val 
545 550 555 560 

GGC TGC TCA CTG GCT TTA GCT GCT GTC TTC CCC CTG GGG CTC GAT GGT 1728 
35 Gly Cys Ser Leu Ala Leu Ala Ala Val Phe Pro Leu Gly Leu Asp Gly- 

565 570 575 

TAC CAC ATT GGG AGG AAC CAG TTT CCT TTC GTC TGC CAG GCC CGC CTC 1776 
Tyr His He Gly Arg Asn Gin Phe Pro Phe Val Cys Gin Ala Arg Leu 
580 585 590 



40 



TGG CTC CTG GGC CTG GGC TTT AGT CTG GGC TAC GGT TCC ATG* TTC ACC 1824 
Trp Leu Leu Gly Leu Gly Phe Ser Leu Gly Tyr Gly Ser Met Phe Thr 

HI 865-1 WO SEQ 149 
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595 600 605 



AAG ATT TGG TGG GTC CAC ACG GTC TTC ACA AAG AAG GAA GAA AAG AAG 1872 
Lys lie Trp Trp Val His Thr Val Phe Thr Lys Lys Glu Glu Lys Lys 
5 610 615 620 



10 



GAG TGG AGG AAG ACT CTG GAA CCC TGG AAG CTG TAT GCC ACA GTG GGC 1920 
Glu Trp Arg Lys Thr Leu Glu Pro Trp Lys Leu Tyr Ala Thr Val Gly 
625 630 635 640 

CTG CTG GTG GGC ATG GAT GTC CTC ACT CTC GCC ATC TGG CAG ATC GTG 1968 
Leu Leu Val Gly Met Asp Val Leu Thr Leu Ala lie Trp Gin lie Val 
645 650 655 



15 GAC CCT CTG CAC CGG ACC ATT GAG ACA TTT GCC AAG GAG GAA CCT AAG 2016 
Asp Pro Leu His Arg Thr lie Glu Thr Phe Ala Lys Glu Glu Pro Lys 
660 665 670 



GAA GAT ATT GAC GTC TCT 
20 Glu Asp lie Asp Val Ser 
675 

AGG AAG ATG AAT ACA TGG 
Arg Lys Met Asn Thr Trp 
25 690 



ATT CTG CCC CAG CTG GAG 
lie Leu Pro Gin Leu Glu 
680 

CTT GGC ATT TTC TAT GGT 
Leu Gly lie Phe Tyr Gly 
695 700 



CAT TGC AGC TCC 2064 

His Cys Ser Ser 

685 

TAC AAG GGG CTG 2112 
Tyr Lys Gly Leu 



CTG CTG CTG CTG GGA ATC TTC CTT GCT TAT GAG ACC AAG AGT GTG TCC 2160 
Leu Leu Leu Leu Gly lie Phe Leu Ala Tyr Glu Thr Lys Ser Val Ser 
705 710 715 720 

ACT GAG AAG ATC AAT GAT CAC CGG GCT GTG GGC ATG GCT ATC TAC AAT 2208 
Thr Glu Lys lie Asn Asp His Arg Ala Val Gly Met Ala lie Tyr Asn 
725 730 735 



35 GTG GCA GTC CTG TGC CTC ATC ACT GCT CCT GTC ACC ATG ATT CTG TCC* 2256 

Val Ala Val Leu Cys Leu lie Thr Ala Pro Val Thr Met lie Leu Ser 

740 745 750 

AGC CAG CAG GAT GCA GCC TTT GCC TTT GCC TCT CTT GCC ATA GTT TTC 2304 

40 Ser Gin Gin Asp Ala Ala Phe Ala Phe Ala Ser Leu Ala lie Val Phe „ 

755 760 765 

TCC TCC TAT ATC ACT CTT GTT GTG CTC TTT GTG CCC AAG ATG CGC AGG 2352 
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Ser Ser Tyr lie Thr Leu Val Val Leu Phe Val Pro Lys Met Arg Arg 
770 775 780 

CTG ATC ACC CGA GGG GAA TGG CAG TCG GAG GCG CAG GAC ACC ATG AAG 2400 
Leu lie Thr Arg Gly Glu Trp Gin Ser Glu Ala Gin Asp Thr Met Lys 
785 790 795 800 

ACA GGG TCA TCG ACC AAC AAC AAC GAG GAG GAG AAG TCC CGG CTG TTG 2448 
Thr Gly Ser Ser Thr Asn Asn Asn Glu Glu Glu Lys Ser Arg Leu Leu 
805 810 815 

GAG AAG GAG AAC CGT GAA CTG GAA AAG ATC ATT GCT GAG AAA GAG GAG 2496 
Glu Lys Glu Asn Arg Glu Leu Glu Lys lie lie Ala Glu Lys Glu Glu 
820 825 830 

CGT GTC TCT GAA CTG CGC CAT CAA CTC CAG TCT CGG CAG CAG CTC CGC 2544 
Arg Val Ser Glu Leu Arg His Gin Leu Gin Ser Arg Gin Gin Leu Arg 
835 840 845 

TCC CGG CGC CAC CCA CCG ACA CCC CCA GAA CCC TCT GGG GGC CTG CCC 2592 
Ser Arg Arg His Pro Pro Thr Pro Pro Glu Pro Ser Gly Gly Leu Pro 
850 855 860 

AGG GGA CCC CCT GAG CCC CCC GAC CGG CTT AGC TGT GAT GGG AGT CGA 2640 
Arg Gly Pro Pro Glu Pro Pro Asp Arg Leu Ser Cys Asp Gly Ser Arg 
865 870 875 880 

GTG CAT TTG CTT TAT AAG TGA 2661 
Val His Leu Leu Tyr Lys 
885 



(2) INFORMATION FOR SEQ ID NO : 77: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 886 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 77: 
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Met Leu Leu Leu Leu Leu Leu Ala Pro Leu Phe Leu Arg Pro Pro Gly 
15 10 15 

Ala Gly Gly Ala Gin Thr Pro Asn Ala Thr Ser Glu Gly Cys Gin lie 
5 20 25 30 

lie His Pro Pro Tip Glu Gly Gly lie Arg Tyr Arg Gly Leu Thr Arg 
35 40 45 

10 Asp Gin Val Lys Ala lie Asn Phe Leu Pro Val Asp Tyr Glu lie Glu 
50 55 60 

Tyr Val Cys Arg Gly Glu Arg Glu Val Val Gly Pro Lys Val Arg Lys 
65 70 75 80 

IS 

Cys Leu Ala Asn Gly Ser Trp Thr Asp Met Asp Thr Pro Ser Arg Cys 
85 90 95 

Val Asn Arg Thr Pro His Ser Glu Arg Arg Ala Val Tyr lie Gly Ala 
20 100 105 110 

Leu Phe Pro Ala Val Glu Met Ala Leu Glu Asp Val Asn Ser Arg Arg 
115 120 125 

25 Asp lie Leu Pro Asp Tyr Glu Leu Lys Leu lie His His Asp Ser Lys 
130 135 140 

Cys Asp Pro Gly Gin Ala Thr Lys Tyr Leu Tyr Glu Leu Leu Tyr Asn 
145 150 155 160 

30 

Asp Pro lie Lys lie lie Leu Met Pro Gly Cys Ser Ser Val Ser Thr 
165 170 175 

Leu Val Ala Glu Ala Ala Arg Met Trp Asn Leu lie Val Leu Ser Tyr 
35 180 185 190 

Gly Ser Ser Ser Pro Ala Leu Ser Asn Arg Gin Arg Phe Pro Thr Phe 
195 200 205 

40 Phe Arg Thr His Pro Ser Ala Thr Leu His Asn Pro Thr Arg Val Lys 
210 215 220 

Leu Phe Glu Lys Trp Gly Trp Lys Lys lie Ala Thr lie Gin Gin Thr 
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225 



230 235 240 



Thr Glu Val Phe Thr Ser Thr Leu Asp Asp Leu Glu Glu Arg Val Lys 
245 250 255 

5 

Glu Ala Gly He Glu He Thr Phe Arg Gin Ser Phe Phe Ser Asp Pro 
260 265 270 

Ala^Val Pro Val Lys Asn Leu Lys Arg Gin Asp Ala Arg He He Val 
10 275 280 285 

Gly Leu Phe Tyr Glu Thr Glu Ala Arg Lys Val Phe Cys Glu Val Tyr 
290 295 300 

15 Lys Glu Arg Leu Phe Gly Lys Lys Tyr Val Trp Phe Leu He Gly Trp 
305 310 315 320 

Tyr Ala Asp Asn Trp Phe Lys He Tyr Asp Pro Ser He Asn Cys Thr 
32.5 330 335 

20 

Val Asp Glu Met Thr Glu Ala Val Glu Gly His He Thr Thr Glu He 
340 345 350 

Val Met Leu Asn Pro Ala Asn Thr Arg Ser He Ser Asn Met Thr Ser 
25 355 360 365 

Gin Glu Phe Val Glu Lys Leu Thr Lys Arg Leu Lys Arg His Pro Glu 
370 375 380 

30 Glu Thr Gly Gly Phe Gin Glu Ala Pro Leu Ala Tyr Asp Ala He Trp 
385 390 395 400 

Ala Leu Ala Leu Ala Leu Asn Lys Thr Ser Gly Gly Gly Gly Arg Ser 
405 410 415 

35 

Gly Val Arg Leu Glu Asp Phe Asn Tyr Asn Asn Gin Thr He Thr Asp 
420 425 430 

Gin He Tyr Arg Ala Met Asn Ser Ser Ser Phe Glu Gly Val Ser Gly 
40 435 440 445 

His Val Val Phe Asp Ala Ser Gly Ser Arg Met Ala Trp Thr Leu He 
450 455 460 
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Glu Gin Leu Gin 
465 

Thr Lys Asp Asp 



Ser Pro Pro Ala 
500 

Ser Gin Lys Leu 
515 

Val Leu Ala Val 
530 



Gly Gly Ser Tyr 
470 

Leu Ser Trp Ser 
485 

Asp Gin Thr Leu 



Phe lie Ser Val 
520 

Val Cys Leu Ser 
535 



Lys Lys lie Gly 
475 

Lys Thr Asp Lys 
490 

Val lie Lys Thr 
505 

Ser Val Leu Ser 



Phe Asn lie Tyr 
540 



Tyr Tyr Asp Ser 
480 

Trp lie Gly Gly 
495 

Phe Arg Phe Leu 
510 

Ser Leu Gly He 
525 

Asn Ser His Val 



Arg Tyr He Gin 
545 

Gly Cys Ser Leu 



Tyr His He Gly 
580 

Trp Leu Leu Gly 
595 

Lys He Trp Trp 
610 

Glu Trp Arg Lys 
625 

Leu Leu Val Gly 



Asp Pro Leu His 
660 

Glu Asp He Asp 
675 



Asn Ser Gin Pro 
550 

Ala Leu Ala Ala 
565 

Arg Asn Gin Phe 



Leu Gly Phe Ser 
600 

Val His Thr Val 
615 

Thr Leu Glu Pro 
630 

Met Asp Val Leu 
645 

Arg Thr He Glu 



Val Ser He Leu 
680 



Asn Leu Asn Asn 
555 

Val Phe Pro Leu 
570 

Pro Phe Val Cys 
585 

Leu Gly Tyr Gly 

Phe Thr Lys Lys 
620 

Trp Lys Leu Tyr 
635 

Thr Leu Ala He 
650 

Thr Phe Ala Lys 
665 

Pro Gin Leu Glu 



Leu Thr Ala Val 
560 

Gly Leu Asp Gly 
575 

Gin Ala Arg Leu 
590 

Ser Met Phe Thr 
605 

Glu Glu Lys Lys 



Ala Thr Val Gly 
640 

Trp Gin He Val- 
655 

Glu Glu Pro Lys 
670 

His Cys Ser Ser 
685 
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Arg Lys Met Asn Thr Trp Leu Gly He Phe Tyr Gly Tyr Lys Gly Leu 
690 695 700 

Leu Leu Leu Leu Gly He Phe Leu Ala Tyr Glu Thr Lys Ser Val Ser 
705 710 715 720 

Thr Glu Lys He Asn Asp His Arg Ala Val Gly Met Ala He Tyr Asn 
725 730 735 

Val Ala Val Leu Cys Leu He Thr Ala Pro Val Thr Met He Leu Ser 
740 745 750 

Ser Gin Gin Asp Ala Ala Phe Ala Phe Ala Ser Leu Ala He Val Phe 
755 760 765 

Ser Ser Tyr He Thr Leu Val Val Leu Phe Val Pro Lys Met Arg Arg 
770 775 780 

Leu He Thr Arg Gly, Glu Trp Gin Ser Glu Ala Gin Asp Thr Met Lys 
785 790 795 800 

Thr Gly Ser Ser Thr Asn Asn Asn Glu Glu Glu Lys Ser Arg Leu Leu 
805 810 815 

Glu Lys Glu Asn Arg Glu Leu Glu Lys lie lie Ala Glu Lys Glu Glu 
820 825 830 

Arg Val Ser Glu Leu Arg His Gin Leu Gin Ser Arg Gin Gin Leu Arg 
835 840 845 

Ser Arg Arg His Pro Pro Thr Pro Pro Glu Pro Ser Gly Gly Leu Pro 
850 855 860 

Arg Gly Pro Pro Glu Pro Pro Asp Arg Leu Ser Cys Asp Gly Ser Arg 
865 870 875 880* 

Val His Leu Leu Tyr Lys 
885 



(2) INFORMATION FOR SEQ ID NO: 78:' 
(i) SEQUENCE CHARACTERISTICS: 

H1865-1 WO SEQ 155 

SUBSTITUTE SHEET (RULE 26) 



WO 99/21890 PCT/SE98/01947 

(A) LENGTH: 1692 base pairs — 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 
<D) TOPOLOGY: linear 

5 

(ii) MOLECULE TYPE: cDNA to mRNA 

(iii) HYPOTHETICAL: NO 

10 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo sapiens 

(ix) FEATURE: 

(A) NAME /KEY : CDS 
15 (B) LOCATION: 1 . . 1692 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 78: 

ATG TTG CTG CTG CTG CTA CTG GCG CCA CTC TTC CTC CGC CCC CCG GGC 48 
20 Met Leu Leu Leu Leu Leu Leu Ala Pro Leu Phe Leu Arg Pro Pro Gly 
1 5 10 15 

GCG GGC GGG GCG CAG ACC CCC AAC GCC ACC TCA GAA GGT TGC CAG ATC 96 
Ala Gly Gly Ala Gin Thr Pro Asn Ala Thr Ser Glu Gly Cys Gin lie 
25 20 25 30 

ATA CAC CCG CCC TGG GAA GGG GGC ATC AGG TAC CGG GGC CTG ACT CGG 144 

lie His Pro Pro Trp Glu Gly Gly lie Arg Tyr Arg Gly Leu Thr Arg 
35 40 45 

30 

GAC CAG GTG AAG GCT ATC AAC TTC CTG CCA GTG GAC TAT GAG ATT GAG 192 

Asp Gin Val Lys Ala lie Asn Phe Leu Pro Val Asp Tyr Glu lie Glu 
50 55 60 

35 TAT GTG TGC CGG GGG GAG CGC GAG GTG GTG GGG CCC AAG GTC CGC AAG- 240 
Tyr Val Cys Arg Gly Glu Arg Glu Val Val Gly Pro Lys Val Arg Lys 
65 70 75 80 

TGC CTG GCC AAC GGC TCC TGG ACA GAT ATG GAC ACA CCC AGC CGC TGT 288 
40 Cys Leu Ala Asn Gly Ser Trp Thr Asp Met Asp Thr Pro Ser Arg Cys 

85 90 95 

GTC CGA ATC TGC TCC AAG TCT TAT TTG ACC ATT ACC GAC CAA ATC TAC 336 

HI 865-1 WO SEQ 156 



SUBSTITUTE SHEET (RULE 26) 



PCT/SE98/01947 

WO 99/21890 

Val Arg lie Cys Ser Lys Ser Tyr Leu Thr lie Thr Asp Gin lie Tyx 
100 105 HO 

CGG GCA ATG AAC TCT TCG TCC TTT GAG GGT GTC TCT GGC CAT GTG GTG 3 84 

Arg Ala Met Asn Ser Ser Ser Phe Glu Gly Val Ser Gly His Val Val 
115 120 125 

TTT GAT GCC AGC GGC TCT CGG ATG GCA TGG ACG CTT ATC GAG CAG CTT 432 
Phe Asp Ala Ser Gly Ser Arg Met Ala Trp Thr Leu lie Glu Gin Leu 
130 135 140 

CAG GGT GGC AGC TAC AAG AAG ATT GGC TAC TAT GAC AGC ACC AAG GAT 480 
Gin Gly Gly Ser Tyr Lys Lys He Gly Tyr Tyr Asp Ser Thr Lys Asp 
145 150 155 160 

GAT CTT TCC TGG TCC AAA ACA GAT AAA TGG ATT GGA GGG TCC CCC CCA 528 
Asp Leu Ser Trp Ser Lys Thr Asp Lys Trp He Gly Gly Ser Pro Pro 
165 170 175 

GCT GAC CAG ACC CTG GTC ATC AAG ACA TTC CGC TTC CTG TCA CAG AAA 57 6 

Ala Asp Gin Thr Leu Val He Lys Thr Phe Arg Phe Leu Ser Gin Lys 
180 185 190 

CTC TTT ATC TCC GTC TCA GTT CTC TCC AGC CTG GGC ATT GTC CTA GCT 624 
Leu Phe He Ser Val Ser Val Leu Ser Ser Leu Gly He Val Leu Ala 
195 200 205 

GTT GTC TGT CTG TCC TTT AAC ATC TAC AAC TCA CAT GTC CGT TAT ATC 672 
Val Val Cys Leu Ser Phe Asn He Tyr Asn Ser His Val Arg Tyr He 
210 215 220 

CAG AAC TCA CAG CCC AAC CTG AAC AAC CTG ACT GCT GTG GGC TGC TCA 720 
Gin Asn Ser Gin Pro Asn Leu Asn Asn Leu Thr Ala Val Gly Cys Ser 
225 230 235 240 

CTG GCT TTA GCT GCT GTC TTC CCC CTG GGG CTC GAT GGT TAC CAC ATT 768 
Leu Ala Leu Ala Ala Val Phe Pro Leu Gly Leu Asp Gly Tyr His He 
245 250 255 



GGG AGG AAC CAG TTT CCT TTC GTC TGC CAG GCC CGC CTC TGG CTC CTG 
Gly Arg Asn Gin Phe Pro Phe Val Cys Gin Ala Arg Leu Trp ^ Leu Leu 
260 265 270* 



816 ; , 



H1865-1 WO SEQ 157 

SUBSTITUTE SHEET (RULE 26) 



WO 99/21890 



PCT/SE98/01947 



GGC CTG GGC TTT AGT CTG GGC TAC GGT TCC ATG TTC ACC AAG ATT TGG 81T4 
Gly Leu Gly Phe Ser Leu Gly Tyr Gly Ser Met Phe Thr Lys lie Trp 
275 280 285 

5 TGG GTC CAC ACG GTC TTC ACA AAG AAG GAA GAA AAG AAG GAG TGG AGG 912 
Trp Val His Thr Val Phe Thr Lys Lys Glu Glu Lys Lys Glu Trp Arg 
290 295 300 

AAG ACT CTG GAA CCC TGG AAG CTG TAT GCC ACA GTG GGC CTG CTG GTG 960 
10 Lys Thr Leu Glu Pro Trp Lys Leu Tyr Ala Thr Val Gly Leu Leu Val 
305 310 315 320 

GGC ATG GAT GTC CTC ACT CTC GCC ATC TGG CAG ATC GTG GAC CCT CTG 1008 
Gly Met Asp Val Leu Thr Leu Ala lie Trp Gin lie Val Asp Pro Leu 
15 325 330 335 

CAC CGG ACC ATT GAG ACA TTT GCC AAG GAG GAA CCT AAG GAA GAT ATT 1056 
His Arg Thr lie Glu Thr Phe Ala Lys Glu Glu Pro Lys Glu Asp lie 
340 345 350 

20 

GAC GTC TCT ATT CTG CCC CAG CTG GAG CAT TGC AGC TCC AGG AAG ATG 1104 
Asp Val Ser lie Leu Pro Gin Leu Glu His Cys Ser Ser Arg Lys Met 
355 360 365 

25 AAT ACA TGG CTT GGC ATT TTC TAT GGT TAC AAG GGG CTG CTG CTG CTG 1152 
Asn Thr Trp Leu Gly lie Phe Tyr Gly Tyr Lys Gly Leu Leu Leu Leu 
370 375 380 

CTG GGA ATC TTC CTT GCT TAT GAG ACC AAG AGT GTG TCC ACT GAG AAG 1200 
30 Leu Gly lie Phe Leu Ala Tyr Glu Thr Lys Ser Val Ser Thr Glu Lys 
385 390 395 400 

ATC AAT GAT CAC CGG GCT GTG GGC ATG GCT ATC TAC AAT GTG GCA GTC 1248 
lie Asn Asp His Arg Ala Val Gly Met Ala lie Tyr Asn Val Ala Val 
35 405 410 415 

CTG TGC CTC ATC ACT GCT CCT GTC ACC ATG ATT CTG TCC AGC CAG CAG 1296 
Leu Cys Leu lie Thr Ala Pro Val Thr Met lie Leu Ser Ser Gin Gin 
420 425 430 



40 



GAT GCA GCC TTT GCC TTT GCC TCT CTT GCC ATA GTT TTC TCC TCC TAT 13 44 

Asp Ala Ala Phe Ala Phe Ala Ser Leu Ala lie Val Phe Ser Ser Tyr 
435 440 445 

HI 865-1 WO SEQ 158 

SUBSTITUTE SHEET (RULE 26) 
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ATC ACT CTT GTT GTG CTC TTT GTG CCC AAG ATG CGC AGG CTG ATC ACC 1392 
He Thr Leu Val Val Leu Phe Val Pro Lys Met Arg Arg Leu He Thr 
450 455 460 

CGA GGG GAA TGG CAG TCG GAG GCG CAG GAC ACC ATG AAG ACA GGG TCA 1440 
Arg Gly Glu Trp Gin Ser Glu Ala Gin Asp Thr Met Lys Thr Gly Ser 
465 470 475 480 

TCG ACC AAC AAC AAC GAG GAG GAG AAG TCC CGG CTG TTG GAG AAG GAG 1488 
Ser Thr Asn Asn Asn Glu Glu Glu Lys Ser Arg Leu Leu Glu Lys Glu 
485 490 495 

AAC CGT GAA CTG GAA AAG ATC ATT GCT GAG AAA GAG GAG CGT GTC TCT 153 6 

Asn Arg Glu Leu Glu Lys He He Ala Glu Lys Glu Glu Arg Val Ser 
500 505 510 

GAA CTG CGC CAT CAA CTC CAG TCT CGG CAG CAG CTC CGC TCC CGG CGC 1584 
Glu Leu Arg His Gin Leu Gin Ser Arg Gin Gin Leu Arg Ser Arg Arg 
515 520 525 

CAC CCA CCG ACA CCC CCA GAA CCC TCT GGG GGC CTG CCC AGG GGA CCC 1632 
His Pro Pro -Thr Pro Pro Glu Pro Ser Gly Gly Leu Pro Arg Gly Pro 
530 535 540 

CCT GAG CCC CCC GAC CGG CTT AGC TGT GAT GGG AGT CGA GTG CAT TTG 1680 
Pro Glu Pro Pro Asp Arg Leu Ser Cys Asp Gly Ser Arg Val His Leu 
545 550 555 560 



CTT TAT AAG TGA 
Leu Tyr Lys * 



i692 



(2) INFORMATION FOR SEQ ID NO : 79: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 563 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

<ii) MOLECULE TYPE: protein 



H1865-1 WO SEQ 159 

SUBSTITUTE SHEET (RULE 26) 



WO 99/21890 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 79: 



PCT/SE98/01947 



Met Leu Leu Leu Leu Leu Leu Ala Pro Leu Phe Leu Arg Pro Pro Gly 
15 10 15 

5 

Ala Gly Gly Ala Gin Thr Pro Asn Ala Thr Ser Glu Gly Cys Gin lie 
20 25 30 

He His Pro Pro Trp Glu Gly Gly He Arg Tyr Arg Gly Leu Thr Arg 
10 35 40 45 

Asp Gin Val Lys Ala He Asn Phe Leu Pro Val Asp Tyr Glu He Glu 
50 55 60 

15 Tyr Val Cys Arg Gly Glu Arg Glu Val Val Gly Pro Lys Val Arg Lys 
65 70 75 80 

Cys Leu Ala Asn Gly Ser Trp Thr Asp Met Asp Thr Pro Ser Arg Cys 
85 90 . 95 

20 

Val Arg He Cys Ser Lys Ser Tyr Leu Thr He Thr Asp Gin He Tyr 
100 105 110 

Arg Ala Met Asn Ser Ser Ser Phe Glu Gly Val Ser Gly His Val Val 
25 115 120 125 

Phe Asp Ala Ser Gly Ser Arg Met Ala Trp Thr Leu He Glu Gin Leu 
130 135 140 

30 Gin Gly Gly Ser Tyr Lys Lys He Gly Tyr Tyr Asp Ser Thr Lys Asp 
145 150 155 160 

Asp Leu Ser Trp Ser Lys Thr Asp Lys Trp He Gly Gly Ser Pro Pro 
165 170 175 

35 

Ala Asp Gin Thr Leu Val He Lys Thr Phe Arg Phe Leu Ser Gin Lys 
180 185 190 

Leu Phe He Ser Val Ser Val Leu Ser Ser Leu Gly He Val Leu Ala 
40 195 200 205 

Val Val Cys Leu Ser Phe Asn He Tyr Asn Ser His Val Arg* Tyr He 
210 215 220 



HI 865-1 WO SEQ 
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Gin Asn Ser Gin Pro Asn Leu Asn Asn Leu Thr Ala Val Gly Cys Ser 
225 230 235 240 

Leu Ala Leu Ala Ala Val Phe Pro Leu Gly Leu Asp Gly Tyr His lie 
245 250 255 

Gly Arg Asn Gin Phe Pro Phe Val Cys Gin Ala Arg Leu Trp Leu Leu 
1 260 265 270 

Gly Leu Gly Phe Ser Leu Gly Tyr Gly Ser Met Phe Thr Lys lie Trp 
275 280 285 

Trp Val His Thr Val Phe Thr Lys Lys Glu Glu Lys Lys Glu Trp Arg 
290 295 300 

Lys Thr Leu Glu Pro Trp Lys Leu Tyr Ala Thr Val Gly Leu Leu Val 
305 310 315 320 

Gly Met Asp Val Leu Thr Leu Ala lie Trp Gin lie Val Asp Pro Leu 
325 330 335 

His Arg Thr lie Glu Thr Phe Ala Lys Glu Glu Pro Lys Glu Asp lie 
340 345 350 

Asp Val Ser He Leu Pro Gin Leu Glu His Cys Ser Ser Arg Lys Met 
355 360 365 

Asn Thr Trp Leu Gly He Phe Tyr Gly Tyr Lys Gly Leu Leu Leu Leu 
370 375 380 

Leu Gly He Phe Leu Ala Tyr Glu Thr Lys Ser Val Ser Thr Glu Lys 
385' 390 395 400 

He Asn Asp His Arg Ala Val Gly Met Ala He Tyr Asn Val Ala VaL 
405 410 415 

Leu Cys Leu He Thr Ala Pro Val Thr Met He Leu Ser Ser Gin Gin 
420 425 430 

Asp Ala Ala Phe Ala Phe Ala Ser Leu Ala He Val Phe Ser Ser Tyr 
435 440 445 



H1865-1 WOSEQ 161 
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lie Thr Leu Val Val Leu Phe Val Pro Lys Met Arg Arg Leu lie Thr — 
450 455 460 

Arg Gly Glu Trp Gin Ser Glu Ala Gin Asp Thr Met Lys Thr Gly Ser 
5 465 470 475 480 

Ser Thr Asn Asn Asn Glu Glu Glu Lys Ser Arg Leu Leu Glu Lys Glu 
485 490 495 

10 Asn Arg Glu Leu Glu Lys lie lie Ala Glu Lys Glu Glu Arg Val Ser 
500 505 510 

Glu Leu Arg His Gin Leu Gin Ser Arg Gin Gin Leu Arg Ser Arg Arg 
515 520 525 

15 

His Pro Pro Thr Pro Pro Glu Pro Ser Gly Gly Leu Pro Arg Gly Pro 
530 535 540 

Pro Glu Pro Pro Asp Arg Leu Ser Cys Asp Gly Ser Arg Val His Leu 
20 545 550 555 560 

Leu Tyr Lys 



25 



35 



40 



(2) INFORMATION FOR SEQ ID NO: 80: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2602 base pairs 
30 (B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

( D) TOPOLOGY : 1 inear 



(ii) MOLECULE TYPE: cDNA to mRNA 

(iii) HYPOTHETICAL: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo sapiens 

(ix) FEATURE: 

(A) NAME/ KEY : CDS 

(B) LOCATION : 1 . .318 



H1865-1 WO SEQ 162 



SUBSTITUTE SHEET (RULE 26) 



WO 99/21890 PCT/SE98/01947 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 80; 



48 



96 



144 



ATG TTG CTG CTG CTG CTA CTG GCG CCA CTC TTC CTC CGC CCC CCG GGC 
Met Leu Leu Leu Leu Leu Leu Ala Pro Leu Phe Leu Arg Pro Pro Gly 
1 5 10 15 

GCG GGC GGG GCG CAG ACC CCC AAC GCC ACC TCA GAA GGT TGC CAG ATC 
Ala Gly Gly Ala Gin Thr Pro Asn Ala Thr Ser Glu Gly Cys Gin He 
20 25 30 

ATA CAC CCG CCC TGG GAA GGG GGC ATC AGG TAC CGG GGC CTG ACT CGG 
He His Pro Pro Trp Glu Gly Gly He Arg Tyr Arg Gly Leu Thr Arg 
35 40 45 

GAC CAG GTG AAG GCT ATC AAC TTC CTG CCA GTG GAC TAT GAG ATT GAT 192 
Asp Gin Val Lys Ala He Asn Phe Leu Pro Val Asp Tyr Glu He Asp 
50 55 60 

GAA TCG AAC GCC ACA CTC AGA ACG GCG CGC AGT GTA CAT CGG GGC ACT 240 
Glu Ser Asn Ala Thr Leu Arg Thr Ala Arg Ser Val His Arg Gly Thr 
65 70 75 80 

GTT TCC CAT GAG CGG GGG CTG GCC AGG GGG CCA GGC CTG CCA GCC CGC 288 
Val Ser His Glu Arg Gly Leu Ala Arg Gly Pro Gly Leu Pro Ala Arg 
85 90 95 

GGT GGA GAT GGC GCT GGA GGA CGT GAA TAG CCGCAGGGAC ATCCTGCCGG 33 8 

Gly Gly Asp Gly Ala Gly Gly Arg Glu * 
100 105 

ACTATGAGCT CAAGCTCATC CACCACGACA GCAAGTGTGA TCCAGGCCAA GCCACCAAGT 398 

ACCTATATGA GCTGCTCTAC AACGACCCTA TCAAGATCAT CCTTATGCCT GGCTGCAGCT 458 

CTGTCTCCAC GCTGGTGGCT GAGGCTGCTA GGATGTGGAA CCTCATTGTG CTTTCCTATG 518 

GCTCCAGCTC ACCAGCCCTG TCAAACCGGC AGCGTTTCCC CACTTTCTTC CGAACGCACC 

CATCAGCCAC ACTCCACAAC CCTACCCGCG TGAAACTCTT TGAAAAGTGG GGCTGGAAGA 63 8 

AGATTGCTAC CATCCAGCAG ACCACTGAGG TCTTCACTTC GACTCTGGAC GACCTGGAGG 



578 



698 



HI 865-1 WO SEQ 
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SUBSTITUTE SHEET (RULE 26) 
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AACGAGTGAA GGAGGCTGGA ATTGAGATTA CTTTCCGCCA GAGTTTCTTC TC AGATCCAG TSS 

CTGTGCCCGT CAAAAACCTG AAGCGCCAGG ATGCCCGAAT CATCGTGGGA CTTTTCTATG 818 

AGACTGAAGC CCGGAAAGTT TTTTGTGAGG TGTACAAGGA GCGTCTCTTT GGGAAGAAGT 878 
ACGTCTGGTT CCTCATTGGG TGGTATGCTG ACAATTGGTT CAAGATCTAC GACCCTTCTA . 938 

TCAACTGCAC AGTGGATGAG ATGACTGAGG CGGTGGAGGG CCACATCACA ACTGAGATTG 998 

TCATGCTGAA TCCTGCCAAT ACCCGCAGCA TTTCCAACAT GACATCCCAG GAATTTGTGG 1058 

AGAAACTAAC CAAGCGACTG AAAAGACACC CTGAGGAGAC AGGAGGCTTC CAGGAGGCAC 1118 

CGCTGGCCTA TGATGCCATC TGGGCCTTGG CACTGGCCCT GAACAAGACA TCTGGAGGAG 1178 

GCGGCCGTTC TGGTGTGCGC CTGGAGGACT TCAACTACAA CAACCAGACC ATTACCGACC 123 8 

AAATCTACCG GGCAATGAAC TCTTCGTCCT TTGAGGGTGT CTCTGGCCAT GTGGTGTTTG 1298 

ATGCCAGCGG CTCTCGGATG GCATGGACGC TTATCGAGCA GCTTCAGGGT GGCAGCTACA 1358 

AGAAGATTGG CTACTATGAC AGCACCAAGG ATGATCTTTC CTGGTCCAAA ACAGATAAAT 1418 

GGATTGGAGG GTCCCCCCCA GCTGACCAGA CCCTGGTCAT CAAGACATTC CGCTTCCTGT 1478 

CACAGAAACT CTTTATCTCC GTCTCAGTTC TCTCCAGCCT GGGCATTGTC CTAGCTGTTG 1538 

TCTGTCTGTC CTTTAACATC TACAACTCAC ATGTCCGTTA TATCCAGAAC TCACAGCCCA 1598 

ACCTGAACAA CCTGACTGCT GTGGGCTGCT CACTGGCTTT AGCTGCTGTC TTCCCCCTGG 1658 

GGCTCGATGG TTACCACATT GGGAGGAACC AGTTTCCTTT CGTCTGCCAG GCCCGCCTCT 1718 

GGCTCCTGGG CCTGGGCTTT AGTCTGGGCT ACGGTTCCAT GTTCACCAAG ATTTGGTGGG 1778 

TCCACACGGT CTTCACAAAG AAGGAAGAAA AGAAGGAGTG GAGGAAGACT CTGGAACCCT 183 8 

GGAAGCTGTA TGCCACAGTG GGCCTGCTGG TGGGCATGGA TGTCCTCACT CTCGCCATCT 1898 

GGCAGATCGT GGACCCTCTG CACCGGACCA TTGAGACATT TGCCAAGGAG GAACCTAAGG 1958 

AAGATATTGA CGTCTCTATT CTGCCCCAGC TGGAGCATTG CAGCTCCAGG AAGATGAATA 2018 

HI 865-1 WOSEQ 164 ~" 
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CATGGCTTGG CATTTTCTAT GGTTACAAGG GGCTGCTGCT GCTGCTGGGA ATCTTCCTTG 2078 

CTTATGAGAC CAAGAGTGTG TCCACTGAGA AGATCAATGA TCACCGGGCT GTGGGCATGG 213 8 

CTATCTACAA TGTGGCAGTC CTGTGCCTCA TCACTGCTCC TGTCACCATG ATTCTGTCCA 2198 

GCCAGCAGGA TGCAGCCTTT GCCTTTGCCT CTCTTGCCAT AGTTTTCTCC TCCTATATCA 2258 

CTCTTGTTGT GCTCTTTGTG CCCAAGATGC GCAGGCTGAT CACCCGAGGG GAATGGCAGT 2318 

CGGAGGCGCA GGACACCATG AAGACAGGGT CATCGACCAA CAACAACGAG GAGGAGAAGT 2378 

CCCGGCTGTT GGAGAAGGAG AACCGTGAAC TGGAAAAGAT CATTGCTGAG AAAGAGGAGC 243 8 

GTGTCTCTGA ACTGCGCCAT CAACTCCAGT CTCGGCAGCA GCTCCGCTCC CGGCGCCACC 249 8 

CACCGACACC CCCAGAACCC TCTGGGGGCC TGCCCAGGGG ACCCCCTGAG CCCCCCGACC 2558 

GGCTTAGCTG TGATGGGAGT CGAGTGCATT TGCTTTATAA GTGA 2602 

(2) INFORMATION FOR SEQ ID NO : 81: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 105 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 81: 

Met Leu Leu Leu Leu Leu Leu Ala Pro Leu Phe Leu Arg Pro Pro Gly 
1 5 10 15 

Ala Gly Gly Ala Gin Thr Pro Asn Ala Thr Ser Glu Gly Cys Gin lie 
20 25 30 

lie His Pro Pro Trp Glu Gly Gly lie Arg Tyr Arg Gly Leu Thr Arg 
35 40 45 

Asp Gin Val Lys Ala lie Asn Phe Leu Pro Val Asp Tyr Glu lie Asp 

HI 865-1 WO SEQ 165 
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50 55 60 

Glu Ser Asn Ala Thx Leu Arg Thr Ala Arg Ser Val His Arg Gly Thr 
65 70 75 80 

5 

Val Ser His Glu Arg Gly Leu Ala Arg Gly Pro Gly Leu Pro Ala Arg 
85 90 95 

Gly Gly Asp Gly Ala Gly Gly Arg Glu 
10 100 , 105 



(2) INFORMATION FOR SEQ ID NO: 82: 

15 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 67 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

20 

(ii) MOLECULE TYPE: cDNA to mRNA 

(iii) HYPOTHETICAL: NO 

25 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo sapiens 

(ix) FEATURE: 

(A) NAME /KEY : CDS 
30 (B) LOCATION : 1 . .294 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 82: 

ATG TTG CTG CTG CTG CTA CTG GCG CCA CTC TTC CTC CGC CCC CCG GGC 48 
35 Met Leu Leu Leu Leu Leu Leu Ala Pro Leu Phe Leu Arg Pro Pro Gly, 
15 10 15 

* GCG GGC GGG GCG CAG ACC CCC AAC GCC ACC TCA GAA GGT TGC CAG ATC 96 
Ala Gly Gly Ala Gin Thr Pro Asn Ala Thr Ser Glu Gly Cys Gin lie 
40 20 25 30 

ATA CAC CCG CCC TGG GAA GGG GGC ATC AGG TAC CGG GGC CTG 'ACT CGG 144 
lie His Pro Pro Trp Glu Gly Gly lie Arg Tyr Arg Gly Leu Thr Arg 

HI 865-1 WO SEQ 166 
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WO 99/21890 PCT/SE98/01947 

35 40 45 

GAC CAG GTG AAG GCT ATC AAC TTC CTG CCA GTG GAC TAT GAG ATT GAG 192 
Asp Gin Val Lys Ala lie Asn Phe Leu Pro Val Asp Tyr Glu lie Glu 
5 50 55 60 

TAT GTG TGC CGG GGG GAG CGC GAG GTG GTG GGG CCC AAG GTC CGC AAG 240 

Tyr Val Cys Arg Gly Glu Arg Glu Val Val Gly Pro Lys Val Arg Lys 
65 70 75 80 

10 

TGC CTG GCC AAC GGC TCC TGG ACA GAT ATG GAC ACA CCC AGC CGC TGT 288 

Cys Leu Ala Asn Gly Ser Trp Thr Asp Met Asp Thr Pro Ser Arg Cys 
85 90 95 

15 GTG TGA TCCAGGCCAA GCCACCAAGT ACCTATATGA GCTGCTCTAC AACGACCCTA 344 
Val * 



TCAAGATCAT CCTTATGCCT GGCTGCAGCT CTGTCTCCAC GCTGGTGGCT GAGGCTGCTA 404 

GGATGTGGAA CCTCATTGTG CTTTCCTATG GCTCCAGCTC ACCAGCCCTG TCAAACCGGC 464 

AGCGTTTCCC CACTTTCTTC CGAACGCACC CATCAGCCAC ACTCCACAAC CCTACCCGCG 524 

25 TGAAACTCTT TGAAAAGTGG GGCTGGAAGA AGATTGCTAC CATCCAGCAG ACCACTGAGG 584 

TCTTCACTTC GACTCTGGAC GACCTGGAGG AACGAGTGAA GGAGGCTGGA ATTGAGATTA 644 

CTTTCCGCCA G AGTTTC TTC TCAGATCCAG CTGTGCCCGT CAAAAACCTG AAGCGCCAGG 704 

ATGCCCGAAT CATCGTGGGA CTTTTCTATG AGACTGAAGC CCGGAAAGTT TTTTGTGAGG 764 

TGTACAAGGA GCGTCTCTTT GGGAAGAAGT ACGTCTGGTT CCTCATTGGG TGGTATGCTG 824 

35 ACAATTGGTT CAAGATCTAC GACCCTTCTA TCAACTGCAC AGTGGATGAG ATGACTQAGG 8 84 

CGGTGGAGGG CCACATCACA ACTGAGATTG TCATGCTGAA TCCTGCCAAT ACCCGCAGCA 944 

TTTCCAACAT GACATCCCAG GAATTTGTGG AGAAACTAAC CAAGCGACTG AAAAGACACC 1004 

CTGAGGAGAC AGGAGGCTTC CAGGAGGCAC CGCTGGCCTA TGATGCCATC TGGGCCTTGG 1064 

CACTGGCCCT GAACAAGACA TCTGGAGGAG GCGGCCGTTC TGGTGTGCGC CTGGAGGACT 1124 



30 



40 
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TCAACTACAA CAACCAGACC ATT AC C G AC C AAATCTACCG GGCAATGAAC TCTTCGTCCT 1184 

TTGAGGGTGT CTCTGGCCAT GTGGTGTTTG ATGCCAGCGG CTCTCGGATG GCATGGACGC 1244 

5 

TTATCGAGCA GCTTCAGGGT GGCAGCTACA AGAAGATTGG CTACTATGAC AGCACCAAGG 1304 

ATGATCTTTC CTGGTCCAAA ACAGATAAAT GGATTGTTAT ATCCAGAACT CACAGCCCAA 13 64 

10 CCTGAACAAC CTGACTGCTG TGGGCTGCTC ACTGGCTTTA GCTGCTGTCT TCCCCCTGGG 1424 

GCTCGATGGT TACCACATTG GGAGGAACCA GTTTCCTTTC GTCTGCCAGG CCCGCCTCTG 1484 

GCTCC TGGGC CTGGGCTTTA GTCTGGGCTA CGGTTCCATG TTCACCAAGA TTTGGTGGGT 1544 

15 

CCACACGGTC TTCACAAAGA AGGAAGAAAA GAAGGAGTGG AGGAAGACTC TGGAACCCTG 1604 

GAAGCTGTAT GCCACAGTGG GCCTGCTGGT GGGCATGGAT GTCCTCACTC TCGCCATCTG 1664 

20 GCAGATCGTG GACCCTCTGC ACCGGACCAT TGAGACATTT GCCAAGGAGG AACCTAAGGA 1724 

AGATATTGAC GTCTCTATTC TGCCCCAGCT GGAGCATTGC AGCTCCAGGA AGATGAATAC 1784 

ATGGCTTGGC ATTTTCTATG GTTACAAGGG GCTGCTGCTG CTGCTGGGAA TCTTCCTTGC 1844 

25 

TTATGAGACC AAGAGTGTGT CCACTGAGAA GATCAATGAT CACCGGGCTG TGGGCATGGC 1904 

TATCTACAAT GTGGCAGTCC TGTGCCTCAT CACTGCTCCT GTCACCATGA TTCTGTCCAG 1964 

30 CCAGCAGGAT GCAGCCTTTG CCTTTGCCTC TCTTGCCATA GTTTTCTCCT CCTATATCAC 2024 

TCTTGTTGTG CTCTTTGTGC CCAAGATGCG CAGGCTGATC ACCCGAGGGG AATGGCAGTC 2084 

GGAGGCGCAG GACACCATGA AGACAGGGTC ATCGACCAAC AACAACGAGG AGGAGAAGTC 2144 

35 

CCGGCTGTTG GAGAAGGAGA ACCGTGAACT GGAAAAGATC ATTGCTGAGA AAGAGGAGCG 2204 

TGTCTCTGAA CTGCGCCATC AACTCCAGTC TCGGCAGCAG CTCCGCTCCC GGCGCCACCC 22 64 

40 ACCGACACCC CCAGAACCCT CTGGGGGCCT GCCCAGGGGA CCCCCTGAGC CCCCCGACCG 2324., 

GCTTAGCTGT GATGGGAGTC GAGTGCATTT GCTTTATAAG TGA * 23 67 
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(2) INFORMATION FOR SEQ ID NO: 83: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 97 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 



~ (ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 83: 

Met Leu Leu Leu Leu Leu Leu Ala Pro Leu Phe Leu Arg Pro Pro Gly 
15 10 15 

Ala Gly Gly Ala Gin Thr Pro Asn Ala Thr Ser Glu Gly Cys Gin He 
20 25 30 

He His Pro Pro Trp Glu Gly Gly He Arg Tyr Arg Gly Leu Thr Arg 
35 40 45 

Asp Gin Val Lys Ala He Asn Phe Leu Pro Val Asp Tyr Glu He Glu 
50 55 60 



Tyr Val Cys Arg Gly Glu Arg Glu Val Val Gly Pro Lys Val Arg Lys 

65 70 75 80 

Cys Leu Ala Asn Gly Ser Trp Thr Asp Met Asp Thr Pro Ser Arg Cys 

85 90 95 



Val 



(2) INFORMATION FOR SEQ ID NO: 84: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2489 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: cDNA to mRNA 
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(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 

5 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo sapiens 

(ix) FEATURE: 
10 (A) NAME/KEY:. CDS 

<B) LOCATION: 1 . .1491 

(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 84: 



15 



20 



30 



ATG TTG CTG CTG CTG CTA CTG GCG CCA CTC TTC CTC CGC CCC CCG GGC 48 
Met Leu Leu Leu Leu Leu Leu Ala Pro Leu Phe Leu Arg Pro Pro Gly 
1 5 10 15 

GCG GGC GGG GCG CAG ACC CCC AAC GCC ACC TCA GAA GGT TGC CAG ATC 96 
Ala Gly Gly Ala Gin Thr Pro Asn Ala Thr Ser Glu Gly Cys Gin He 
20 25 30 



ATA CAC CCG CCC TGG GAA GGG GGC ATC AGG TAC CGG GGC CTG ACT CGG 144 
He His Pro Pro Trp Glu Gly Gly He Arg Tyr Arg Gly Leu Thr Arg 
25 3 5 40 45 

GAC CAG GTG AAG GCT ATC AAC TTC CTG CCA GTG GAC TAT GAG ATT GAG 192 
Asp Gin Val Lys Ala He Asn Phe Leu Pro Val Asp Tyr Glu He Glu 
5 0 55 60 



TAT GTG TGC CGG GGG GAG CGC GAG GTG GTG GGG CCC AAG GTC CGC AAG 240 
Tyr Val Cys Arg Gly Glu Arg Glu Val Val Gly Pro Lys Val Arg Lys 
65 70 75 80 

35 TGC CTG GCC AAC GGC TCC TGG ACA GAT ATG GAC ACA CCC AGC CGC TGT - 288 
Cys Leu Ala Asn Gly Ser Trp Thr Asp Met Asp Thr Pro Ser Arg Cys 
85 90 95 

GTC CGA ATC TGC TCC AAG TCT TAT TTG ACC CTG GAA AAT GGG AAG GTT 33 6 

40 Val Arg He Cys Ser Lys Ser Tyr Leu Thr Leu Glu Asn Gly Lys Val 
100 105 HO 

TTC CTG ACG GGT GGG GAC CTC CCA GCT CTG GAC GGA GCC CGG GTG GAT 384 
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Phe Leu Thr Gly Gly Asp Leu Pro Ala Leu Asp Gly Ala Arg Val Asp ™ 
115 120 125 



TTC CGG TGT GAC 
5 Phe Arg Cys Asp 
130 

AAG TAG CTA TAT 
Lys Tyr Leu Tyr 
10 145 

ATG CCT GGC TGC 
Met Pro Gly Cys 

15 

ATG TGG AAC CTC 
Met Trp Asn Leu 
180 

20 TCA AAC CGG CAG 
Ser Asn Arg Gin 
195 

ACA CTC CAC AAC 
25 Thr Leu His Asn 
210 

AAG AAG ATT GCT 
Lys Lys lie Ala 
30 2 2 5 

CTG GAC GAC CTG 
Leu Asp Asp Leu 

35 

TTC CGC CAG AGT 
Phe Arg Gin Ser 
260 

40 AAG CGC CAG GAT 
Lys Arg Gin Asp 
275 



CCC GAC TTC CAT CTG 
Pro Asp Phe His Leu 
135 

GAG CTG CTC TAC AAC 
Glu Leu Leu Tyr Asn 
150 

AGC TCT GTC TCC ACG 
Ser Ser Val Ser Thr 
165 

ATT GTG CTT TCC TAT 
lie Val Leu Ser Tyr 
185 

CGT TTC CCC ACT TTC 
Arg Phe Pro 'Thr Phe 
200 

CCT ACC CGC GTG AAA 
Pro Thr Arg Val Lys 
215 

ACC ATC CAG CAG ACC 
Thr lie Gin Gin Thr 
230 

GAG GAA CGA GTG AAG 
Glu Glu Arg Val Lys 
245 

TTC TTC TCA GAT CCA 
Phe Phe Ser Asp Pro 
265 

GCC CGA ATC ATC GTG 
Ala Arg lie lie Val, 
280 



TGT GAT CCA GGC CAA 
Cys Asp Pro Gly Gin 
140 

GAC CCT ATC AAG ATC 
Asp Pro lie Lys lie 
155 

CTG GTG GCT GAG GCT 
Leu Val Ala Glu Ala 
170 

GGC TCC AGC TCA CCA 
Gly Ser Ser Ser Pro 
190 

TTC CGA ACG CAC CCA 
Phe Arg Thr His Pro 
205 

CTC TTT GAA AAG TGG 
Leu Phe Glu Lys Trp 
220 

ACT GAG GTC TTC ACT 
Thr Glu Val Phe Thr 
235 

GAG GCT GGA ATT GAG 
Glu Ala Gly lie Glu 
250 

GCT GTG CCC GTC AAA 
Ala Val Pro Val Lys 
270 

GGA CTT TTC TAT GAG 
Gly Leu Phe Tyr Glu 
285 



GCC ACC 432 
Ala Thr 

ATC CTT 480 
lie Leu 
160 

GCT AGG 528 

Ala Arg 

175 

GCC CTG 576 
Ala Leu 

TCA GCC 624 
Ser Ala 

GGC TGG 672 
Gly Trp 

TCG ACT 720 
Ser Thr 
240 

ATT ACT 768 

He Thr 

255 

AAC CTG 816 
Asn Leu 

ACT GAA 864 „ 

Thr Glu 
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GCC CGG AAA GTT TTT TGT GAG GTG TAC AAG GAG CGT CTC TTT GGG AAG 9^2 
Ala Arg Lys Val Phe Cys Glu Val Tyr Lys Glu Arg Leu Phe Gly Lys 
290 295 300 

5 AAG TAC GTC TGG TTC CTC ATT GGG TGG TAT GCT GAC AAT TGG TTC AAG 960 
Lys Tyr Val Trp Phe Leu lie Gly Trp Tyr Ala Asp Asn Trp Phe Lys 
305 310 315 320 

ATC TAC GAC CCT TCT ATC AAC TGC ACA GTG GAT GAG ATG ACT GAG GCG 1008 
10 lie Tyr Asp Pro Ser lie Asn Cys Thr Val Asp Glu Met Thr Glu Ala 

325 330 335 

GTG GAG GGC CAC ATC ACA ACT GAG ATT GTC ATG CTG AAT CCT GCC AAT 1056 
Val Glu Gly His He Thr Thr Glu He Val Met Leu Asn Pro Ala Asn 
15 340 345 350 

ACC CGC AGC ATT TCC AAC ATG ACA TCC CAG GAA TTT GTG GAG AAA CTA 1104 
Thr Arg Ser He Ser Asn Met Thr Ser Gin Glu Phe Val Glu Lys Leu 
355 360 365 

20 

ACC AAG CGA CTG AAA AGA CAC CCT GAG GAG ACA GGA GGC TTC CAG GAG 1152 
Thr Lys Arg Leu Lys Arg His Pro Glu Glu Thr Gly Gly Phe Gin Glu 
370 375 380 

25 GCA CCG CTG GCC TAT GAT GCC ATC TGG GCC TTG GCA CTG GCC CTG AAC 1200 
Ala Pro Leu Ala Tyr Asp Ala He Trp Ala Leu Ala Leu Ala Leu Asn 
385 390 395 400 

AAG ACA TCT GGA GGA GGC GGC CGT TCT GGT GTG CGC CTG GAG GAC TTC 1248 
30 Lys Thr Ser Gly Gly Gly Gly Arg Ser Gly Val Arg Leu Glu Asp Phe 

405 410 415 

AAC TAC AAC AAC CAG ACC ATT ACC GAC CAA ATC TAC CGG GCA ATG AAC 1296 
Asn Tyr Asn Asn Gin Thr He Thr Asp Gin He Tyr Arg Ala Met Asn 
35 420 425 430 

TCT TCG TCC TTT GAG GGT GTC TCT GGC CAT GTG GTG TTT GAT GCC AGC 13 44 

Ser Ser Ser Phe Glu Gly Val Ser Gly His Val Val Phe Asp Ala Ser 
435 440 445 



40 



GGC TCT CGG ATG GCA TGG ACG CTT ATC GAG CAG CTT CAG GGT GGC AGC 1392 
Gly Ser Arg Met Ala Trp Thr Leu lie Glu Gin Leu Gin Gly" Gly Ser 
450 455 460 
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TAC AAG AAG ATT GGC TAC TAT GAC AGC ACC AAG GAT GAT CTT TCC TGG 
Tyr Lys Lys lie Gly Tyr Tyr Asp Ser Thr Lys Asp Asp Leu Ser Trp 
465 470 475 480 



1440 



TCC AAA ACA GAT AAA TGG ATT GTT ATA TCC AGA ACT CAC AGC CCA ACC 
Ser Lys Thr Asp Lys Trp lie Val lie Ser Arg Thr His Ser Pro Thr 
485 490 495 



1488 



10 TGA ACAACCTGAC TGCTGTGGGC TGCTCACTGG CTTTAGCTGC TGTCTTCCCC 



1541 



15 



CTGGGGCTCG ATGGTT AC C A CATTGGGAGG AACCAGTTTC CTTTCGTCTG CCAGGCCCGC 1601 
CTCTGGCTCC TGGGCCTGGG CTTTAGTCTG GGCTACGGTT CCATGTTCAC CAAGATTTGG 1661 
TGGGTCCACA CGGTCTTCAC AAAGAAGGAA GAAAAGAAGG AGTGGAGGAA GACTCTGGAA 1721 



20 



CCCTGGAAGC TGTATGCCAC AGTGGGC CTG CTGGTGGGCA TGGATGTCCT CACTCTCGCC 1781 



25 



ATCTGGCAGA TCGTGGACCC TCTGCACCGG ACCATTGAGA CATTTGCCAA GGAGGAACCT 1841 

AAGGAAGATA TTGACGTCTC TATTCTGCCC CAGCTGGAGC ATTGCAGCTC CAGGAAGATG 1901 

AATACATGGC TTGGCATTTT CTATGGTTAC AAGGGGCTGC TGCTGCTGCT GGGAATCTTC 1961 

CTTGCTTATG AGACCAAGAG TGTGTCCACT GAGAAGATCA ATGATCACCG GGCTGTGGGC 2021 



30 



ATGGCTATCT ACAATGTGGC AGTCCTGTGC CTCATCACTG CTCCTGTCAC CATGATTCTG 2081 



35 



TCCAGCCAGC AGGATGCAGC CTTTGCCTTT GCCTCTCTTG CCATAGTTTT CTCCTCCTAT 2141 

ATCACTCTTG TTGTGCTCTT TGTGCCCAAG ATGCGCAGGC TGATCACCCG AGGGGAATGG 2201 

CAGTCGGAGG CGCAGGACAC CATGAAGACA GGGTCATCGA CCAACAACAA CGAGGAGGAG 2261 

AAGTCCCGGC TGTTGGAGAA GGAGAACCGT GAACTGGAAA AGATCATTGC TGAGAAAGAG 2321 



40 



GAGCGTGTCT CTGAACTGCG CCATCAACTC CAGTCTCGGC AGCAGCTCCG CTCCCGGCGC 23 81 y 



CACCCACCGA CACCCCCAGA ACCCTCTGGG GGCCTGCCCA GGGGACCCCC TGAGCCCCCC 2441 
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GACCGGCTTA GCTGTGATGG GAGTCGAGTG CATTTGCTTT ATAAGTGA 2469 



(2) INFORMATION FOR SEQ ID NO: 85: 

5 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 496 amino acids 

(B) TYPE: amino acid 
{ D ) TOPOLOGY : 1 inear 

10 

<ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 85: 

15 Met Leu Leu Leu Leu Leu Leu Ala Pro Leu Phe Leu Arg Pro Pro Gly 
15 10 15 

Ala Gly Gly Ala Gin Thr Pro Asn Ala Thr Ser Glu Gly Cys Gin lie 
20 25 30 

20 

lie His Pro Pro Trp Glu Gly Gly lie Arg Tyr Arg Gly Leu Thr Arg 
35 40 45 

Asp Gin Val Lys Ala lie Asn Phe Leu Pro Val Asp Tyr Glu lie Glu 
25 50 55 60 

Tyr Val Cys Arg Gly Glu Arg Glu Val Val Gly Pro Lys Val Arg Lys 
65 70 75 80 

30 Cys Leu Ala Asn Gly Ser Trp Thr Asp Met Asp Thr Pro Ser Arg Cys 

85 90 95 

Val Arg lie Cys Ser Lys Ser Tyr Leu Thr Leu Glu Asn Gly Lys Val 
100 105 110 

35 

Phe Leu Thr Gly Gly Asp Leu Pro Ala Leu Asp Gly Ala Arg Val Asp 
115 120 125 

Phe Arg Cys Asp Pro Asp Phe His Leu Cys Asp Pro Gly Gin Ala Thr 
40 130 135 140 

Lys Tyr Leu Tyr Glu Leu Leu Tyr Asn Asp Pro lie Lys lie lie Leu 
145 150 155 160 
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Met Pro Gly Cys Ser Ser Val Ser Thr Leu Val Ala Glu Ala Ala Arg 
165 170 175 

5 Met Trp Asn Leu lie Val Leu Ser Tyr Gly Ser Ser Ser Pro Ala Leu 
180 185 190 

Ser Asn Arg Gin Arg Phe Pro Thr Phe Phe Arg Thr His Pro Ser Ala 
195 200 205 

10 

Thr Leu His Asn Pro Thr Arg Val Lys Leu Phe Glu Lys Trp Gly Trp 
210 215 220 

Lys Lys He Ala Thr He Gin Gin Thr Thr Glu Val Phe Thr Ser Thr 
15 225 230 235 240 

Leu Asp Asp Leu Glu Glu Arg Val Lys Glu Ala Gly He Glu lie Thr 
245 250 255 

20 Phe Arg Gin Ser Phe Phe Ser Asp Pro Ala Val Pro Val Lys Asn Leu 
260 265 270 

Lys Arg Gin Asp Ala Arg He He Val Gly Leu Phe Tyr Glu Thr Glu 
275 280 285 

25 

Ala Arg Lys Val Phe Cys Glu Val Tyr Lys Glu Arg Leu Phe Gly Lys 
290 295 300 

Lys Tyr Val Trp Phe Leu He Gly Trp Tyr Ala Asp Asn Trp Phe Lys 
30 305 310 315 320 

He Tyr Asp Pro Ser He Asn Cys Thr Val Asp Glu Met Thr Glu Ala 
325 330 335 

35 Val Glu Gly His He Thr Thr Glu He Val Met Leu Asn Pro Ala Asn 
340 345 350 

Thr Arg Ser He Ser Asn Met Thr Ser Gin Glu Phe Val Glu Lys Leu 
355 360 365 

40 

Thr Lys Arg Leu Lys Arg His Pro Glu Glu Thr Gly Gly Phe Gin Glu 
370 375 380 
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Ala Pro Leu Ala Tyr Asp Ala lie Trp Ala Leu Ala Leu Ala Leu Ash 
385 390 395 400 

Lys Thr Ser Gly Gly Gly Gly Arg Ser Gly Val Arg Leu Glu Asp Phe 
5 405 410 415 

Asn Tyr Asn Asn Gin Thr lie Thr Asp Gin lie Tyr Arg Ala Met Asn 
420 425 430 

10 Ser Ser Ser Phe Glu Gly Val Ser Gly His Val Val Phe Asp Ala Ser 
435 440 445 

Gly Ser Arg Met Ala Trp Thr Leu lie Glu Gin Leu Gin Gly Gly Ser 
450 455 460 

15 

Tyr Lys Lys lie Gly Tyr Tyr Asp Ser Thr Lys Asp Asp Leu Ser Trp 
465 470 475 480 

Ser Lys Thr Asp Lys Trp lie Val lie Ser Arg Thr His Ser Pro Thr 
20 485 490 495 
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